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5 MODULATORS OF TISSUE REGENERATION 

Related Applications 

This is a continuation-in-part of prior U.S. Provisionals S.N. 60/047,490 and S.N. 
60/047,491, both filed May 23, 1997. The teachings of both earlier-filed Provisional patent 
applications are incorporated herein by reference. 

10 Field of the Invention 

The invention relates generally to gene expression products, including nucleic acids 
(e.g., RNAs) and polypeptides that are upregulated in injured or regenerating tissues. 
Further, the invention relates generally to cDNAs and other nucleic acids encoding 
polypeptides that are upregulated in injured or regenerating tissues. 

15 Background of the Invention 

The processes of tissue development during embryogenesis, and of tissue repair 
and/or regeneration following injury or insult are presently the topic of intense 
investigation. Both processes involve a dynamic remodeling of tissue architecture, which 
is triggered and mediated by numerous biological interactions, including cell-cell contact, 
20 cell-matrix contact, release of soluble biological response modifiers, synthesis of structural 
components, and many other changes in cell phenotype, including changes in gene 
expression. Many of the factors involved in tissue genesis and in the response to tissue 
insult remain unknown or poorly understood. 

Several systems have been developed for modeling the events triggered by insult to 
25 particular tissues, and/or by particular types of insult. For example, several investigators 
have described events occurring in mammalian kidney tissue exposed to an ischemia- 
reperfiision insult. The kidney is able to repair damage to the proximal tubule epithelium 
through a complex series of events involving cell death, proliferation of surviving proximal 
tubule epithelial cells, formation of poorly differentiated regenerative epithelium over the 
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denuded basement membrane, and differentiation of the regenerative epithelium to form 
fully functional proximal tubule epithelial cells (Wallin et al., Lab. Invest 66:474-484, 
1992; Witzgall et al., Mol. Cell. Biol. 13:1933-1942, 1994; Ichimuraet al., Am. J. Physiol. 
269:F653-662, 1995; Thadhani et al., N. Engl. J. Med. 334:1448-1460, 1996). Growth 
factors such as IGF, EGF, and HGF have been implicated in this process of repair, as has 
the endothelial cell adhesion molecule ICAM-1. However, the mechanisms by which the 
tubular epithelial cells arc restored, either functionally and morphologically, remain poorly 
understood. 

There accordingly is a need for an improved understanding of the biological 
processes of tissue repair or regeneration. That is, there is a need to identify factors that are 
indicative of the initiation, development and resolution of normal wound healing or normal 
biological response to tissue insult. Similarly, there is a need to identify factors that are 
indicative of pathologies of abnormal responses to tissue insult or other stimulus, including 
autoimmune or other dysregulation pathologies, as well as pathologies associated with 
abnormal tissue growth including neoplastic growth. Further, there is a need to identify 
factors that constitute appropriate targets for therapeutic intervention to stimulate, 
modulate, enhance, suppress or otherwise manipulate biological responses to tissue insult. 
Similarly, there is a need to identify factors that constitute appropriate targets for 
therapeutic intervention to manipulate abnormal or dysregulated responses to tissue insult 
or other stimulus, including abnormal tissue growth associated with cancer (neoplasia) and, 
conversely, abnormal tissue quiescence associated with degenerative diseases. 

Summary of the Invention 

It is an object of this invention to identify factors that are indicative of the initiation, 
development and resolution of normal wound healing or normal biological response to 
tissue insult. A specific object of this invention is to identify products (e.g., RNAs, 
cDNAs, polypeptides) of genes that are upregulated in the context of healing tissue, or in 
tissue exposed to insult or injury. A particular object is to identify products of genes that 
are upregulated in mammalian kidney tissue exposed to insult or injury, such as ischemia- 
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reperfusion injury. A second specific object of this invention is to identify products of 
genes that are upregulated in pathological conditions associated with abnormal responses 
to tissue insult or other stimulus, including autoimmune or other dysregulation pathologies, 
as well as pathologies associated with abnormal tissue growth including neoplastic growth .' 
5 A third specific object is to identify gene products that constitute appropriate targets for 
therapeutic intervention to stimulate, modulate, enhance, suppress or otherwise manipulate 
biological responses to tissue insult. A particular object is to identify gene products for 
therapeutic intervention to manipulate responses of mammalian kidney tissue to tissue 
injury, such as ischemia-reperfusion injury. A fourth specific object is to identify gene 
10 products that constitute appropriate targets for therapeutic intervention to manipulate 
abnormal or dysregulated responses to tissue insult or other stimulus, including abnormal 
tissue growth associated with cancer (neoplasia) and, conversely, abnormal tissue 
quiescence associated with degenerative diseases. Thus, other particular objects of the 
invention include the identification of gene products as targets for therapeutic intervention 
15 in the clinical management (including prophylaxis, maintenance and treatment) of kidney 
diseases, including diseases involving renal failure, and of cancers derived from or 
affecting renal tissue. 

The present invention rests on the discovery that the expression levels of numerous 
genes, including many genes heretofore unknown and/or uncharacterized, are upregulated 

20 in mammalian kidney tissue that has been exposed to tissue insult or injury. More 

specifically, the present invention rests on the discovery of a novel class of Kidney Injury- 
associated Molecules (each member of the class is henceforth called a "KIM*). HMs are 
polypeptides encoded by genes whose expression levels are upregulated in the mammalian 
kidney upon exposure to tissue injury, particularly after exposure to ischemia-reperfusion 

25 injury. Any KIM is useful as an indicator of tissue status (preferably renal tissue status) or 
of a change therein (e.g, exposure to tissue injury, or the occurrence or stage of a tissue 
repair process). Any KIM further is useful as a target for therapeutic intervention to 
manipulate, whether by an agonist or an antagonist, a normal or abnormal tissue response 
to insult or other stimulus. 
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The invention accordingly provides, in a first aspect, purified and isolated nucleic 
acid molecules encoding all or a unique fragment of a KM. In one embodiment, the 
present nucleic acids are RNAs. In another, they are DNA molecules, such as cDNAs. In 
another embodiment, the invention provides the complementary strands of nucleic acids 
encoding all or a unique fragment of a HM. In another embodiment, the invention 
provides nucleic acids (preferably DNAs) that hybridize under low or, preferably, high 
stringency conditions to any of the foregoing nucleic acids. In still another embodiment, 
the invention provides nucleic acids (preferably DNAs) which, but for the degeneracy of 
the genetic code, would hybridize to any of the foregoing nucleic acids. In some 
embodiments, a nucleic acid encoding all or a unique fragment of a KIM is an engineered 
(recombinant) nucleic acid, optionally in operative association with an expression control 
element or other regulatory element. In other embodiments, a nucleic acid encoding all or 
a unique fragment of a KIM is an antisense nucleic acid sufficient, when internalized 
within a cell, to disrupt expression of a cellular KIM gene. 

Specific novel KTMs (and unique fragments thereof) of the present invention are 
coded for by nucleic acids having the sequences disclosed herein in TABLE 1 and in the 
Sequence Listing. Other specific KIMs (and unique fragments thereof) are coded for by 
nucleic acids having sequences that are degenerate variants of any of the KIM sequences 
set forth in TABLE 1 and in the Sequence Listing. Still other specific KIMs (and 
fragments) are coded for by nucleic acids that are substantially similar to (homologous to) 
any of the KIM sequences in TABLE 1 and in the Sequence Listing. Such KIMs are 
defined herein as variants of the disclosed novel KIM sequences. In some embodiments, 
the present nucleic acid encodes a chimeric polypeptide comprising a novel KIM-encoding 
sequence (i.e., a sequence encoding part or all of a KIM) fused to a non-KIM sequence. 
Thus, the invention provides nucleic acids encoding novel KIM fusion proteins, non- 
limiting examples of which include KIM polypeptides fused to a secretable leader 
polypeptide, an immunoglobulin polypeptide, a binding pair partner (e.g., avidin, GST), a 
toxin or toxoid (e.g., ricin, tetanus), an enzyme (preferably one for which a detectable 
substrate is available, e.g., alkaline phosphatase, horseradish peroxidase, luciferase). or 
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other detectable polypeptide (e.g., green fluorescent protein). Additional specific KIMs 
(and unique fragments thereof) are coded for by nucleic acids referred to in TABLE 2. It is 
believed that, in most to all instances, these specific molecules have not heretofore been 
appreciated to be KIMs. 

In a second aspect, the invention provides a vector having a KIM-encoding nucleic 
acid inserted therein. In some embodiments, the vector is a biologically functional plasmid 
or viral DNA vector. In other embodiments, the vector is a retroviral vector. 

In a third aspect, the invention provides a prokaryotic or eukaryotic host cell 
comprising an internalized vector having a KIM-encoding nucleic acid insert. The present 
host cell provides intracellular means for producing (synthesizing, folding, processing, or 
secreting) a KIM polypeptide of the present invention. 

In a fourth aspect, the invention provides a process for the production of a KIM 
polypeptide. The present production process includes growing a host cell of the invention 
under culture conditions sufficient for the production of polypeptides of vector origin, and 
recovering an expressed KIM polypeptide. 

In a fifth aspect, the invention provides a purified and isolated novel KIM 
polypeptide, preferably substantially free of non-KIM polypeptides or proteins. In some 
embodiments, the present polypeptide is a full-length polypeptide, i.e., a polypeptide 
corresponding to the full-length open reading frame of a novel KIM-encoding cDNA. In 
other embodiments, the present polypeptide is a unique fragment of the full-length 
polypeptide. Thus, specific novel KIM polypeptides (and unique fragments thereof) of the 
present invention are coded for by nucleic acids having the sequences disclosed herein in 
TABLE 1 and in the Sequence Listing. Other specific novel KIM polypeptides of the 
present invention are variants of the disclosed novel KIM sequences, including without 
limitation splice variants, truncation variants, and substitution variants. In some 
embodiments, the invention provides a chimeric polypeptide comprising a novel KJM 
polypeptide fused (preferably via a peptide bond) to a non-HM polypeptide. Thus, the 
invention provides novel KIM fusion proteins, exemplified by the above-mentioned fusion 
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constructs. Of course, the invention also provides conjugated or derivatized novel KIM 
polypeptides, including without limitation detectable conjugates, imageable conjugates, 
radiolabeled conjugates, and toxin conjugates. In still other embodiments, the invention 
provides chimeric polypeptides, fusion proteins and conjugates comprising a polypeptide 
appreciated herein as being a KIM (see TABLE 2). 

In a sixth aspect, the invention provides an antibody that binds selectively to a KIM 
polypeptide. Preferably, the antibody is a monoclonal antibody (or an engineered 
derivative thereof) produced by conventional means from a hybridoma derived from 
splenocytes of an animal immunized with a KIM polypeptide of the present invention. In 
some embodiments, the present antibody is conjugated or derivatized with a detectable 
moiety, toxin, imageable compound or radionuclide. 

In a seventh aspect, the invention provides a pharmaceutical composition 
comprising a therapeutically effective amount of a KIM nucleic acid (e.g., an antisense 
nucleic acid); a KIM vector, a KIM polypeptide; a KIM fusion protein; a KIM-binding 
antibody (also referred to as an anti-KIM antibody), dispersed, dissolved or otherwise 
suspended in a physiologically acceptable carrier, vehicle, solvent or excipient. 

m an eighth aspect, the invention provides methods and kits for detecting, either 
qualitatively or quantitatively, KIM expression and/or KIM polypeptide. As disclosed 
herein, KIMs are upregulated by exposure of mammalian kidney (renal) tissue to insult or 
injury, particularly ischemia-reperfusion injury. Thus, any KIM (or combination or panel 
thereof) can be used as an indicator of tissue status (preferably renal tissue status), or of a 
change therein, including without limitation exposure to tissue injury, the occurrence or 
stage of an injury process, the occurrence or stage of a tissue repair or regeneration process, 
or the occurrence of an abnormal tissue response to injury or other stimulus, such as an 
autoimmune response or an abnormal proliferative response (e.g., a neoplastic response). 

In one embodiment, the invention provides a detection method and kit for 
diagnosis, prognosis, staging, or monitoring of renal injury or of renal disease or of the 
effectiveness of therapy therefor. Some specific embodiments involve detecting and/or 
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measuring the concentration of one or more KIM polypeptides in serum, urine, or urine 
sediment of an individual (a mammal, preferably a human) afflicted with or at risk of 
developing renal injury or an impairment of renal function. Some such embodiments 
involve the use of at least one KIM polypeptide, anti-KIM antibody or conjugate thereof as 

5 elements of a reagent kit for immunoassay according to standard techniques. For present 
purposes, any KIM disclosed herein or identified as such herein can be used as a kit 
element, along with other elements for KIM detection. In other instances, such as where 
the detected KIM has biological or enzymatic activity, the kit can include reagents for 
detecting KIM activity, e.g., by enzyme assay. Other specific embodiments involve 

10 detecting and/or measuring the level of expression of one or more KIM-encoding nucleic 
acids in renal cells in a kidney biopsy, or in cells shed into urine or urine sediment of an 
individual afflicted with or at risk of developing renal injury or an impairment of renal 
function. Some such embodiments involve the use of a nucleic acid (e.g., RNA or DNA) 
encoding a KIM polypeptide, or a unique fragment thereof, or a probe nucleic acid capable 

15 of hybridizing to nucleic acid encoding a KIM, as elements of a reagent kit for 

hybridization assay according to standard techniques. As above, any KIM nucleic acid 
disclosed herein or appreciated herein as such can be used as a hybridization kit element, 
along with other reagents for detection of hybridized nucleic acids. 

In another embodiment, the invention provides a method and kit for diagnosis, 
20 prognosis, staging or monitoring of an abnormal response of renal tissue to tissue injury or 
other stimulus, including an autoimmune response or an abnormal proliferative response, 
such as neoplasia arising from or affecting renal tissue. Some specific embodiments 
involve detecting and/or measuring the concentration of one or more KIM polypeptides in 
serum, urine, or urine sediment of an individual (a mammal, preferably a human) afflicted 
25 with or at risk of developing renal injury or an impairment of renal function. Some such 
embodiments involve the use of at least one KIM polypeptide, anti-KIM antibody or 
conjugate thereof as elements of a reagent kit for immunoassay according to standard 
techniques. For present purposes, any KIM disclosed herein or identified as such herein can 
be used as a kit element, along with other elements for KIM detection. In other instances, 
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such as where the detected KIM has biological or enzymatic activity, the kit can include 
reagents for detecting KIM activity, e.g., by enzyme assay. Other specific embodiments 
involve detecting and/or measuring the level of expression of one or more KIM-encoding 
nucleic acids in cells present in a kidney biopsy, or in cells shed into urine or urine 
sediment of an individual afflicted with or at risk of developing renal injury or an 
impairment of renal function. Some such embodiments involve the use of a nucleic acid 
(e.g., RNA or DNA) encoding a KIM polypeptide, or a unique fragment thereof, or a probe 
nucleic acid capable of hybridizing to nucleic acid encoding a KIM, as elements of a 
reagent kit for hybridization assay according to standard techniques. As above, any KIM 
nucleic acid disclosed herein or appreciated herein as such can be used as a hybridization 
kit element, along with other reagents for detection of hybridized nucleic acids. 

In a ninth aspect, the invention provides a method and reagent for imaging tissues, 
either in vitro or in vivo. In particular, the invention provides a method and reagent for 
imaging the presence, extent or severity of kidney injury or of kidney tissue repair or 
regeneration. Similarly, the invention provides a method and reagent for imaging the 
presence, extent, severity or stage of an abnormal response to tissue injury or other 
stimulus, such as autoimmunity or neoplasia, particularly neoplasia arising from or 
affecting renal tissue. The present invention also provides a method for targetting an 
imageable compound to cells or tissue expressing or producing a KIM. The present 
method involves the step of contacting cells or tissue with a detectable (e.g., imageable) 
KIM-binding reagent, which in some embodiments is an anti-KIM antibody or conjugate 
thereof, or a KIM fusion protein. In other embodiments, the detectable KIM-binding 
reagent is a nucleic acid of the invention (e.g., a probe or antisense nucleic acid) labeled 
with a radionuclide or other imageable compound. According to the present method, cells 
expressing or producing a KIM are visualized (imaged) by detecting the presence and/or 
location of an accumulation of the KIM-binding reagent. For imaging in vivo, the KIM- 
binding reagent is administered, by any appropriate route, to an individual (a mammal, 
preferably a human) suspected of harboring an imageable locus of KIM expression and/or 
production. Without being limited hereby, it is believed that the present method can be 
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used to detect the presence and/or location of a tissue mass (e.g., a tumor) abnormally 
producing or expressing a KIM. 

In a tenth aspect, the invention provides methods for treating an individual (a 
mammal, preferably a human) afflicted with or at risk of developing a disease or condition 
contributed to or associated with a dysfunction or dysregulation of a KIM gene or protein. 
Further, the invention provides methods for treating an individual afflicted with or at risk 
of developing a disease or condition beneficially affected by therapeutic aclministration of a 
KIM protein. The present methods involve the step of administering to the individual a 
therapeutically effective amount of a KIM polypeptide, variant or fusion protein thereof, or, 
conversely, of an anti-HM antibody. It is expected that such compounds will be useful in 
therapeutic methods which manipulate, e.g., stimulate or inhibit, biological responses that 
are dependent on KIM function. 

In an eleventh aspect, the invention provides a method for inhibiting the growth of 
KIM-expressing tumor cells, involving the step of contacting the cells with an anti-KIM 
antibody, conjugated to a toxin or radionuclide. In an alternative embodiment, the method 
involves the step of contacting the cells with an antisense KIM nucleic acid that is 
sufficient to suppress or disrupt expression of a KIM gene in the tumor cells. 

In a twelfth aspect, the invention provides a method of gene therapy. The present 
method involves the administration of a vector capable of directing the production of a 
KIM, to an individual afflicted with or at risk of a renal disorder, disease or injury. As a 
result of the present method, growth of new tissue, preferably renal tissue, is stimulated, or 
survival of existing tissue, preferably renal tissue, is promoted. 

The foregoing and other objects, features, aspects and advantages of the present 
invention, as well as the invention itself, will be more fully understood from the following 
description of preferred embodiments. 
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Detailed Descrioti n f the Invent! n 

The discovery, reported herein, of a novel class of Kidney Injury-related Molecules 
(KIMs) was made by analyzing differences in mRNA expression between normal adult 
mammalian kidneys, and kidneys in the process of regenerating following exposure to 
5 tissue insult (specifically, to ischemia-reperfusion injury). Two established techniques 
were used for this purpose: representational difference analysis (RDA), and suppression 
subtractive hybridization (SSH). Both techniques were used to assess cDNAs isolated 
from various timepoints, e.g., 48 hours, after the onset of ischemia. In these studies, the 
normal adult kidney material was isolated from sham-operated subjects. Both techniques 

10 resulted in the depletion of cDNAs which are common to both postischemic and to normal 
kidney samples, leaving a pool of cDNAs which are significantly expressed only in injured 
or regenerating kidney tissue. This pool likely contains cDNAs corresponding to genes 
that encode proteins involved in the injury process, and proteins involved in tissue repair or 
regeneration processes. Therefore, cDNAs isolated from the pool likely encode parts or all 

15 of proteins likely to be therapeutically beneficial for treatment or prophylaxis of tissue 
injury, especially renal injury. Several cDNA clones have been obtained, sequenced and 
characterized. 

Selected Definitions 

A "KIM protein", herein used synonymously with "KIM", is any protein or 

20 polypeptide encoded by mRNA which is selectively upregulated following injury to a 
kidney. One group of KIM proteins of interest includes those coded for by mRNA which 
is selectively upregulated at any time within one week following any insult which results in 
injury to renal tissue. Examples of times at which such upregulation might be identified 
include 10 hours, 24 hours, 48 hours or 96 hours following an insult. Examples of tissue 

25 insults include toxin exposure, hypoxia, hyperoxia, hemodynamic disruption, ischemia, 
reperfusion, or mechanical compression. Many different types of proteins fall within the 
KIM class, including cell surface proteins (e.g., ligands or counter-receptors involved in 
cell-cell or cell-matrix interactions), secreted proteins (e.g., diffusable biological response 
modifiers, such as growth factors, differentiation factors, survival factors and the like), 
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intracellular proteins (e.g., elements of a signalling pathway), and nuclear proteins (e.g., 
transcription factors). 

A "KIM ligand" is any molecule which noncovalently and specifically binds to a 
KIM protein. Such a ligand can be a protein, peptide, steroid, antibody, amino acid 
derivative, or other type molecule, in any form, including naturally-occurring, 
recombinantly produced, or otherwise synthetic. A KIM ligand can be in any form, 
including soluble, membrane-bound, or part of a fusion construct with immunoglobulin, 
fatty acid, or other moieties. The KIM ligand may be an integrin. A membrane-bound 
KIM ligand can act as a receptor which, when bound to or associated with KIM, triggers a 
cellular response. In some interactions, a KIM may associate with a plurality of KIM 
ligands, or may associate with a KIM ligand as part of a complex with one or more other 
molecules or cofactors. In a situation where both the KIM and the KIM ligand are bound 
to cell membranes, the KIM may associate and react with KIM ligand which is bound to 
the same cell as the KIM, or it may associate and react with KIM ligand be bound to a 
second cell. Where the KIM ligation occurs between molecules bound to different cells, the 
two cells may be the same or different with respect to cellular type or origin, phenotypic or 
metabolic condition, or type or degree of cellular response (e.g., growth, differentiation or 
apoptosis) to a given stimulus. "KIM ligation" refers to the contact and binding of KIM 
with a KIM ligand. 

A "unique fragment" of a nucleic acid means any fragment of sufficient length to 
have a sequence likely to be substantially unique in a mammalian genome. Thus, a unique 
fragment generally means an oligonucleotide at least 16 nucleotide bases in length. 
Similarly, a "unique fragment" of a polypeptide means any fragment of sufficient length to 
have an amino acid sequence likely to be substantially unique to a given mammalian 
protein, such as a KIM. Thus, a unique peptide fragment generally means a peptide at least 
7 amino acids in length. 

A "KIM variant" means a KIM whose sequence differs from a sequence disclosed 
herein by the presence of one or more internal or terminal insertions, deletions or 
substitutions of a nucleotide (when referring to KIM nucleic acids) or of an amino acid 
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(when referring to KIM polypeptides). Preferably, the KIM variant is "substantially 
similar" to the corresponding disclosed KIM, or to a unique fragment thereof. That is, the 
KIM variant is "homologous" to the corresponding disclosed KIM. "Substantially 
similar" or "homologous" variants are structurally similar to the corresponding disclosed 
KIM. Further, "substantially similar" or "homologous" variants have sufficient functional 
similarity to the corresponding disclosed KIM that they share one or more of the KIM's 
biological properties or functions (e.g., binding to a receptor or ligand, triggering of 
biological responses, transport of a metabolite, catalysis of a substrate, or the like). A KIM 
variant can be naturally occurring or synthesized or produced by routine techniques, such 
as molecular engineering techniques. 

By "alignment of sequences" is meant the positioning of one sequence, either 
nucleotide or amino acid, with that of another, to allow a comparison of the sequence of 
relevant portions of one with that of the other. Generally, sequences are aligned using the 
GAP and BESTFIT programs, which are based on the teachings of Needleman et al. (J. 
Mol. Biol. 48:443-453, 1970), Smith et al. (Adv. Appl. Math. 2:482-489, 1981), and 
RechUd et al. (CABIOS 5:107-1 13, 1989). Generally, when a homologous variant of a 
KIM is aligned with the corresponding KIM polypeptide, it will share amino acids that 
contribute to the KIM's three dimensional structure, such as cysteine residues. Although 
the relative positions of cysteine residues is generally conserved in the variant sequence, 
homologous or functionally equivalent sequences can include functionally equivalent 
arrangements of the cysteines, including arrangements comprising amino acid insertions or 
deletions which alter the linear arrangement of the cysteines, but do not materially impair 
their relationship or ability to form disulfide bonds in the folded structure of the KIM 
protein. Therefore, minor internal gaps and amino acid insertions, such as occur in splice 
variants, are ignored when aligning sequences herein. 

"Sequence homology", "percent (%) homology", "sequence similarity" and 
"percent (%) similarity" are used interchangeably herein, and refer to the sum of the 
percentage of residues (whether nucleotides or amino acids) that are, when aligned with a 
reference sequence, the same as the corresponding reference residues, and those that are, 
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when aligned, conservative substitutions for the corresponding reference residues. 
"Sequence identity" and "percent (%) identity" also are used interchangeably herein, and 
refer to the percentage of residues (whether nucleotides or amino acids) that are, when 
aligned with a reference sequence, the same as the corresponding reference residues. Both 
sets of terms are used according to their definitions in Altschul et al. (1990), J. Mol. Biol. 
215:403-410 and in the Basic Local Alignment Search Tool (BLAST) algorithm described 
therein. For present purposes, the algorithm gap weight is set at 3.0 and the length weight 
is set at 0.1. 

"Hybridization" means the formation of a duplex nucleic acid molecule, in which 
nucleotide bases of a first polynucleotide strand bind noncovalently with cognate 
nucleotide bases of a second polynucleotide strand. Generally, such noncovalent binding 
occurrs in DNA only between adenosine (A) and thymidine (T) bases, and guanosine (G) 
and cytosine (C) bases. In RNA, binding occurs only between A and uracil (U), and G and 
C. The apposition of non-cognate bases in a duplex nucleic acid molecule, e.g., A with C, 
is termed a "mismatch" pairing. Under high stringency hybridization conditions, 
mismatches occur rarely in a given duplex nucleic acid molecule. Low stringency 
conditions permit the occurrence of some mismatches. Exemplary conditions which 
promote DNA hybridization, termed "hybridization conditions", include 6.0X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 2.0X SSC at 50°C. 
The salt concentration in the wash step can be selected from a low stringency wash of 
about 2.0X SSC at 50°C, to a high stringency was of about 0.2X SSC at 50°C. In addition, 
the temperature in the wash step can be selected from a low stringency wash at room 
temperature, about 22°C, to a high stringency wash at about 65°C. 

A "KIM agonist" is a molecule which can specifically trigger a cellular response 
normally triggered by the interaction of KIM with a KIM ligand. A KIM agonist can be a 
KIM variant, or a specific antibody to KIM, or a soluble form of the KIM ligand. 

A "KIM antagonist" is a molecule which can specifically associate with a KIM 
ligand or a KIM, thereby blocking or otherwise inhibiting KIM binding to the KIM ligand. 
The antagonist binding blocks or inhibits cellular responses which would otherwise be 
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triggered by ligation of the KIM ligand with KIM or with a KIM agonist. Examples of 
KIM antagonists include certain KIM variants, KIM fusion proteins and specific antibodies 
to a KIM ligand or KIM. 

"Isolated" refers to a condition in which a nucleic acid or polypeptide of the present 
invention is essentially free of other nucleic acids, polypeptides, or of other contaminants 
with which it might normally be found in nature, and as such exists in a form not found in 
nature. 

"Substantially pure" refers to a condition in which a nucleic acid or polypeptide of 
the present invention is separated from other nucleic acids, polypeptides, or other 
contaminants, particularly naturally occurring contaminants, that interfere with the ability 
to detect, visualize or isolate the the nucleic acid or polypeptide of the invention, or that 
interfere with a biological function or property thereof. A substantially pure nucleic acid or 
polypeptide of the invention is not generally found in nature. 

A "chemical derivative" of another molecule contains one or more additional 
chemical or biochemical moieties not found naturally in association with the molecule. 
Such moieties may improve the molecule's solubility, absorption, biological half life, etc. 
The moieties may alternatively decrease the toxicity of the molecule, ehminate or attenuate 
any undesirable side effect of the molecule, etc. Moieties capable of mediating such effects 
are disclosed, for example, in Remington's Pha rmaceutical Scjgasgs, 16th ed., Mack 
Publishing Co., Easton, Penn. (1980). 

Other terms used herein, including "antisense DNA", "antisense probe", "cloning", 
"cDNA", "cDNA library", "DNA polymorphism", "expression", "gene", "hybridoma", 
"plasmid", "probe", "labeled", "recombinant", "host cells", "transformed", "transfected", 
"vector", and the like are used in the sense of their art-recognized meanings in the fields of 
cell biology, cell culture, molecular biology, transgenic biology, microbiology, 
recombinant DNA or genetic engineering, and immunology. Such meanings are 
determined by consultation of one or more of the following widely available texts: 
Molecular Clonin p, A Laboratory Manual , 2nd Ed. (Sambrook, Fritsch and Maniatis, eds.), 



14- 



SUBSTTTUTE SHEET (RULE 26) 



WO 98/53071 



PCIYUS98/10547 



Cold Spring Harbor Laboratory Press, 1989; DNA Cloning . Volumes I and H (Glover, ed.), 
198 * Oligonucleotide Synthesis (Gait, ed.), 1984; U.S. Patent 4,683,195, Mullis et al., 
mvs - Nucleic Acid Hybridization (Hames & Higgins, eds.), 1984; Transcription and 
Translation (Hames & Higgins, eds.), 1984; Culture of Animal Telly (Freshney), Alan R. 
Liss, publ., 1987; Immobilized Cells and Enzvmes (IRL Press), 1986; A Practical Guide tr> 
Molecular Cloning (Perbal), 1984; Current Protocols in Molecular Biolopv Wiley & Sons, 
publ., 1989; Methods in EnzvmolopY, Academic Press, New York NY (especially Volumes 
154 and 155); Gene Transfer Vectors for Mammalian Cells (Miller and Calos, eds.), Cold 
Spring Harbor Laboratory Press, 1987; Immunochemical Methods in Cell and Molecular 
Biology (Mayer and Walker, eds.), Academic Press, London, 1987; Handbook of 
Experimental Immunology, Volumes I-IV (Weir and Blackwell, eds.), 1986; and, 
Manipulating the Mouse Embryo Cold Spring Harbor Laboratory Press, 1986. 

Compounds of the Invention 

Specific, novel KIMs discovered through RDA or SSH techniques are set forth 
below in TABLE 1 and in the Sequence Listing. TABLE 1 lists KIMs for which potential 
full length (PFL) cDNA clones were obtained. Also shown are the predicted polypeptide 
sequences encoded in the open reading frames (ORFs) of many of the newly discovered 
KIMs. In addition, TABLE 1 provides, where relevant, the subtracted amplified clone 
(SAC) designations and the PFL location(s) of partial sequences, many of which were 
reported in prior U.S. Provisional S.N. 60/047,490 and 60/047,491. 
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SEQ. NO. 


PFL 


CLONE 
DESIGNATION 


OPEN READING FRAME (OFR) 
LOCATION SEQ. ID # 


# 3 


HW011 


HW011 






# 4 


HW012 


HW012 


39. . 671 


#5 


# 6 


HW013 


HW013 






# 7 


HW014 


HW014rev 


175. . 804 


#8 


# 9 


HW015 


HW15 


11 ..2176 


# 10 


# 12 


HW017 


SAC 24091 






# 13 


HW018 
HW101 


SAC_23880 


86 . . 1102 


# 14 


# 16 


HW033 


SAC 23901 






# 17 


HW034 


SAC_23897 


91 . . 837 
3101 . . 4162 


# 18 
#19 


# 20 


HW035 
HW112 


SAC_24468 






# 21 


HW036 


SAC_24406 






# 22 


HW037 

LIU/4 AA 

HW102 


SAC_24354 


280 . . 1422 




# 26 


HW040 


SAC 24520 


481 . . 2433 


#27 


# 28 


HW041 


SAC 24317 






# 29 


HW042 
HW113 


SAC_24017 


8 . . 889 


# 30 


# 31 


HW043 


SAC__24533 


8. .634 


# 32 


t? 33 


HW044 


SAC_24216 


24.. 1100 


# 34 


* 35 


HW045 


SAC_24028 


9 . . 1628 


# 36 


t? 37 


MW046 


A A 4 AAA 

SAC_24036 


9. .1160 


# 38 


* 39 


HW047 


SAC_23915 


8.. 511 


# 40 


ff 42 


HW050 

iii*/ ^ An 

liWiuy 


SAC_24644 


7 . . 1326 


# 43 




U\A/nCH 

nWOOl 


SAC_24170 


114.. 1505 


# 45 


# 46 


HW052 


SAC_24882 


247 . . 765 


# 47 




MW055 


SAC_24449 


214 . . 1329 


# 50 




UiA/nec 
nWUob 


SAC_24326 


8 . . 523 


# 52 


if M 

Tf OO 


nwuo/ 


SAC_23926 


7 . . 1023 


# 54 


if w 


nvvuoy 


©AC_24457 


9 . . 1271 


# 56 


if *T7 


nwuoi 


o a ^ n ^ AAA 

oAC_24029 


7 . . 1422 


# 58 


# 61 


nwuoy 


oAC_24477 


1360 . . 1893 


# 62 


# 64 


HW073 


SAC 24456 


9.. 1223 


# 65 


# 66 


HW074 


SAC_24464 


7. .300 


# 67 


# 66 


HW075 
HW117 


SAC_24466 






# 69 


HW076 


SAC_24409 






# 71 


HW080 


SAC_24033 






# 72 


HW082 


SAC_24469 


150 . . 569 


# 73 


# 75 


HW084 


SAC 24854 






# 76 


HW088 


SAC 24336 


9 . . 686 


# 77 


# 78 


HW089 


SAC 24461 


8 . . 1006 


# 79 
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TABLE 1, continued 



ceo no 


DPI 


f*l OKIE 

DESIGNATION 


OPEN READING FRAME (OFR) 
LOCATION SEQ. ID # 


# 80 


HW090 
HW118 


SAC_24197 


335 . 937 


# 81 


# 82 


HW092 


SAC_24320 






# 83 


HW093 


SAC_24538 


8. . 1357 


# 84 


# 85 


HW094 


SAC_23896 


255 . . 1238 


# 86 


# 87 


HW095 


SAC_23802 


7 . . 876 


# 88 


# 89 


HW096 


NONE 


295. . 1302 


# 90 



Additional KIMs, which are believed to be related to (e.g., substantially similar to) 
known molecules also were obtained through RDA or SSH techniques, and are set forth in 

5 TABLE 2 and in the Sequence Listing. It is believed that few to none of these molecules 
were previously appreciated to be KIMs, i.e., to have relevance to tissue injury or repair 
processes, particularly in the kidney. TABLE 2 lists these KIMs along with their clone 
designations. Partial sequences of some of the cDNA clones listed below were also set 
forth in the priority document. Also shown are the name(s) and GENBANK accession 

10 numbers of the known genes to which these KIMs may be related. 



TABLE 2 



SEQ1D# 


DESIGNATION 


RELATED GENE 


Accession # 


92 


23798 


Rat alpha 1A (1D) adrenergic 
receptor 


M60654 


93 


23800 


Rat N27 


V30789 


94 


23801 


Rat alpha-2mu globulin-related 
protein 


X13295 


95 


23833 


Rat contrapsin-like protease 
inhibitor related protein 


X16359 


96 


23836 


Rat Fit-1 


V04319 


97 


23853 


Rat tumor-associated antigen 
(PE4) 


L12025 


98 


23906 


Rat sulfated glycoprotein 2 


X13231 


99 


23907 


Mouse secreted protein (p85) 


L33416 


100 


23916 


Human Differentiation-dependent 
A4 protein 


L09604 
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SEQ1D# 


DESIGNATION 


RELATED GFNF 


ACC888IOn w 


101 


23922 


Rat manganese-containing 
superoxide dismutase 


Y00497 


102 


24030 


Rat heme oxygenase 


J02722 


103 


24078 


Rat tissue inhibitor of 
metailoprotein 
asel (TIMP1) 


U 16022 


104 


24105 


Rat alpha 2- macrogiobulin 


J02635 


105 


24140 


Rat Cathepsin D 


X54467 


106 


24142 


Rat lysozyme 


L12458 


107 


24192 


Mouse MAMA mRNA 


X67809 


108 


24296 


Rat transketotase 


U09256 


109 


24445 


Rat heat stable antigen CD24/ 
Elonqation factor 1 


Z11531 


110 


24527 


Cathepsin L 


S85184 


111 


24540 


Mouse endothelial monocyte- 
activatina polypeptide 1 


U41341 


112 


24623 


Rat spleen thymosin beta4 


K01334 


1 


HW010 


Human semaphorin (CD1 00) 


U60800 


11 


HW016 


Mouse ribonucleotide reductase 
M2 subunit 


M14223 


15 


HW032 


Human S100C 


D49355 


24 


HW038 


Human splicing factor SF3a120 


X85237 


25 


HW039 


Annexin 11 


M82802 


41 




Mouse u22 snoRNA host gene 
(UHG) 


U40654 


48 


HW054 


Rat myelin oligodendricyte protein 
(MOG) 


M99485 


59 


HW062 


Mouse proliferation-associated 
protein 1 


U43918 


60 


HW066 


Human cleavage stimulation 
factor 50kd subunit 


L02547 


63 


HW070 


Human BRCA2 region mRNA 
sequence CG037 


U50523 
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TABLE 2, continued 



SEQ1D# 


DESIGNATION 


RELATED GENE 


Accession # 


70 


HW078 


Human G protein gamma-10 
subunit 


P50151 


74 


HW083 


Rat ins-1 winqed hslix 


P97691 


91 


HW097 


Human Na+ channel 2 


U78181 



Any of the foregoing KIM cDNAs can be inserted into a vector, liposome or other 
carrier vehicle for internalization and production in a host cell. Furthermore, the invention 
encompasses derivatives and variants of each of the foregoing KJMs as listed in TABLES 1 
and! 

One embodiment of the invention provides soluble variants of a KIM protein that is 
usually synthesized as a membrane associated protein. Soluble variants lack at least a 
portion of the transmembrane or intra-membrane section of a native KIM protein. In some 
examples, the soluble variant lacks the entire transmembrane or intra-membrane section of 
a native KIM protein. Soluble variants include fusion proteins which encompass 
derivatives of KIM proteins that lack at least a portion of the transmembrane or intra- 
membrane section of a native KIM protein. All types of KIM fusion proteins are included, 
particularly those which incorporate his-tag, Ig-tag, and myc-tag forms of the molecule. 
These KIM fusions may have characteristics which are therapeutically advantageous, such 
as the increased half-life conferred by the Ig-tag. Also included are fusion proteins which 
incorporate portions of selected domains of the KIM protein. 

Variants can differ from a naturally occurring KIM protein in amino acid sequence 
or in ways that do not involve sequence, or both. Variants in amino acid sequence are 
produced when one or more amino acids in naturally occurring KIM protein is substituted 
with a different natural amino acid, an amino acid derivative or non-natural amino acid. 
Particularly preferred substitution variants include naturally occurring KIM proteins, or 
biologically active unique fragments thereof, whose sequences differ from the wild type 
sequence by one or more conservative amino acid substitutions, which typically have 
minimal influence on the secondary structure and hydrophobic nature of the protein or 
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peptide. Conservative substitutions typically include the substitution of one amino acid for 
another with similar characteristics, such as substitutions within the following groups: 
valine, glycine; glycine, alanine; valine, isoleucine; aspartic acid, glutamic acid; 
asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. 
5 Other conservative substitutions are those which meet the criteria for an "accepted point 
mutation" as defined in the Atlas of Protein Sequence and Structure (Dayhoff et al., eds.), 
1978. See also PCT publication no. WO97/44460. 

Substitution variants can also have sequences which differ by one or more non- 
conservative amino acid substitutions, provided however that the substitution does not 

10 abolish the native KIM protein's biological activity, ligand- or receptor-binding 

characteristics, or other functional property of interest. Exemplary non-conservative 
substitutions are those in which: (I) a hydrophilic residue, e.g., serine or threonine, is 
substituted for (or by) a hydrophobic residue, e.g., leucine, isoleucine, phenylalanine, or 
alanine; (ii) a cysteine residue is substituted for (or by) any other residue; (iii) a residue 

15 having an electropositive side chain, e.g., lysine, arginine or histidine, is substituted for (or 
by) a residue having an electronegative charge, e.g., glutamic acid or aspartic acid; or (iv) a 
residue having a bulky side chain, e.g., phenylalanine, is substituted for (or by) one not 
having such a side chain, e.g., glycine. 

Other KIM variants of this invention include truncation variants (comprising at 
20 least a unique fragment of the corresponding KIM), insertion variants, and splice variants. 

Other KIM variants within the invention are those with modifications which 
increase polypeptide stability. Such variants can contain, for example, one or more non- 
peptide bonds (which replace the peptide bonds) in the peptide sequence. Also included 
are: variants that include residues other than naturally occurring L-amino acids, such as D- 
25 amino acids or non-naturally occurring or synthetic amino acids such as beta or gamma 
amino acids and cyclic variants. Incorporation of D- instead of L-amino acids into the 
polypeptide may increase its resistance to proteases. See, e.g., U.S. Patent 5,219,990. 
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KIM variants can be naturally-occurring, or produced through synthetic or 
molecular engineering techniques. Those of skill in the art will understand and appreciate 
that an engineered KIM variant can provide advantageous properties, e.g., in facilitating 
purification, improving stability, modulating a biological function, or the like. Thus, in 
some instances, a KIM variant will be desired that lacks a glycosylation site, or that has 
decreased aggregation potential due to elimination of a hydrophobic surface, or that has 
improved folding efficiency due to elimination of a cysteine residue, or the like. 

In general, KIM variants have at least fifty (50) % amino acid sequence homology 
or similarity with the corresponding KIM protein. Preferably, the variants have at least 
sixty-five (65) % sequence similarity, more preferably at least eighty (80) % similarity with 
the corresponding KIM protein. Still more preferably, the sequence similarity is at least 
ninety (90) %, or most preferably, at least ninety-five (95) %. Other preferred KIM 
variants are those which 1) share at least forty (40) % similarity to the corresponding KIM 
protein, and 2) share at least eighty (80) % of aligned cysteine residues with the 
corresponding KIM protein. 

Just as it is possible to replace substituents of the amino acid backbone or scaffold, 
it is also possible to replace, modify or add non-amino acid moieties to the scaffold. Such 
moieties can occur naturally in a given KIM as a result of post-translational processing, 
including acetylation, methylation, phosphorylation, carboxylation or glycosylation. As 
desired, such moieties can be removed or added by conventional synthetic or biochemical 
techniques. Further, non-natural moieties can be added to produce a derivative of a KIM or 
KIM variant of this invention. For example, polyethylene glycol (PEG) can be linked to a 
KIM to improve its stability or pharmacokinetic properties. 

As will be readily appreciated, anti-KIM and anti-KIM variant antibodies can be 
produced by conventional techniques. Specifically contemplated are polyclonal and 
monoclonal antibodies, including antigen-binding fragments thereof and engineered 
derivatives thereof. Antigen-binding fragments of intact antibodies include complete Fab 
fragments, F(ab^ compounds, V H regions, and F v regions. Engineered derivatives of 
antibodies of the invention include single chain antibodies (see, e.g., WO 96/23071), as 
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well as human, humanized, primatized, or chimeric antibodies (see, e.g., PCT/US 
95/16709). Engineered derivatives of the present antibodies are produced, generally, using 
standard recombinant DNA techniques (Winter and Milstein, Nature 349: 293-99, 1991). 
These include "chimeric" antibodies, in which the antigen binding domain from an animal 

5 antibody is linked to a human constant domain. In this instance, an antibody with the 
desired KIM-binding selectivity is derived initially from a nonhuman mammal (e.g., a 
mouse, rat or hamster), and subjected to recombinant DNA manipulation to replace all or 
part of the hinge and constant regions of the heavy chain and/or the constant region of the 
light chain, with corresponding regions from a human immunoglobin light chain or heavy 

10 chain. (See, e.g., Cabilly et al., United States Pat. No. 4,816,567; Morrison et al., Proc. 
Natl. Acad. Sci. 81: 6851-55, 1984). Chimeric antibodies reduce the immunogenic 
responses elicited by animal antibodies when used in human clinical treatments. 

In addition, recombinant "humanized" antibodies can be synthesized. Humanized 
antibodies are antibodies initially derived from a nonhuman mammal in which recombinant 

15 DNA technology has been used to substitute some or all of the amino acids not required for 
antigen binding with amino acids from corresponding regions of a human immunoglobin 
light or heavy chain. That is, they are chimeras comprising mostly human immunoglobulin 
sequences into which the regions responsible for specific antigen-binding have been 
inserted (see, e.g., PCT patent application WO 94/04679). Animals are immunized with 

20 the desired antigen, the corresponding antibodies are isolated and the portion of the 
variable region sequences responsible for specific antigen binding are removed. The 
animal-derived antigen binding regions are then cloned into the appropriate position of the 
human antibody genes in which the antigen binding regions have been deleted. Humanized 
antibodies minimize the use of heterologous (inter-species) sequences in antibodies for use 

25 in human therapies, and are less likely to elicit unwanted immune responses. Primatized 
antibodies can be produced similarly. 

If desired, fully human antibodies with KIM-binding specificity which can be 
produced in nonhuman animals, such as transgenic animals harboring one or more human 
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immunoglobulin transgenes. Such animals may be used as a source for splenocytes for 
producing hybridomas, as is described in U.S. 5,569,825. 

Identification of Specific KIMs of the Invention 

To assess the dynamic biological processes of response to injury and tissue repair, a 

5 kidney ischemia-reperfusion model, which similates acute renal failure, was employed. 
The results of studies in this model system now have been published, and appear in 
Ichimura et al. (1998) . J. Biol. Chem. 273:4135-4142, the teachings of which are 
incorporated by reference herein. In this model, the kidney has the capacity for cell 
renewal (both structurally and funcitonally) after injury to tubular epithelial cells. The 

10 nephron is damaged functionally by an ischemic reperfusion injury that results in regional 
areas of proximal tubule cell death. During the repair process, the kidney proximal tubule 
epithelium undergoes a complex series of events including (1) cell death and cast 
formation in the tubule lumen (casts are aggregates of dead, semiviable and viable cells, as 
well as of cell debris); (2) proliferation of surviving proximal tubule epithelial cells; (3) 

15 formation of a poorly differentiated regenerative epithelium over the denuded basement 
membrane (this simplified epithelium expresses vimentin, a mesenchymal marker); and (4) 
differentiation of the regenerative epithelium to form fully functional proximal tubule 
epithelial cells. Gene expression analysis of the kidney at various timepoints following the 
onset of ischemia revealed the upregulation of many KIMs anticipated to be involved in the 

20 injury and repair/regeneration processes. 

1. Generation of ischemic and normal rat adult kidnevs 

Ischemic injured rat kidneys are generated as described by Witzgall et al. (J. Clin 
Invest. 93: 2175-2188, 1994). Briefly, the renal artery and vein from one kidney of an 
adult Sprague-Dawley rat are clamped for 40 minutes and then reperfused. Injured 
25 kidneys are harvested from the rats at 24 hours and at 48 hours after reperfusion. Kidneys 
from sham-operated, normal adult Sprague-Dawley rats are also harvested. 

2. mRNA isolation 
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Post-ischemic rat kidneys are prepared as described by Witzgall et al. (J. Clin. 
Invest 93: 2175-2188, 1994). Briefly, the renal artery and vein of the left kidney of an 
adult Sprague Dawley rat are clamped for 40 minutes and then reperfiised. The ischemic 
kidneys are removed from rats either 24 hours or 48 hours after reperfusion. Normal rat 
5 kidneys are used as controls. 

Total RNA is isolated with Trizol reagent (BRL catalog No. 1559-026). The kidney 
capsules are removed, then the kidneys are rinsed in PBS and immediately homogenized in 
Trizol solution. In some instances, the kidney is chopped into several pieces in PBS, frozen 
quickly in liquid nitrogen and kept at -70°C until it is processed for total RNA isolation. 
10 The mRNA is purified from total RNA according to the manufacturer's instructions, using 
an mRNA purification kit (Pharmacia, Catalog No. 27-9258-02). 

3(i). Representational difference analysis (RDA^ method to clone short-fragment cDNAs 
up-regulated bv ischemia 

The RDA method of Hubank and Schatz (Nucleic Acid Research 22:5640-48, 
15 1994) is performed as described, with modifications as follow. Double-stranded cDNA is 
synthesized using the Superscript Choice® system (BRL Catalog No. 18090), from mRNA 
isolated from rat kidney either 24 hours or 48 hours post-ischemia as a tester and normal 
rat kidney as a driver. The tester and driver cDNA is digested with DpnII and ligated to R- 
Bgl- 12/24 oligonucleotides. The adapter-ligated cDNA is amplified by PCR, and the PGR 
20 product is digested with DpnII to remove the oligonucleotides. The tester cDNA is ligated 
to J-Bgl- 12-24 oligonucleotides. 

Full length cDNA clones for Kim-1 and NMB contained in a plasmid vector are 
provided by T. Ichimura. They are digested with Sau3Al, purified and used as quenching 
reagents. cDNA fragments, 2-3 containing a fragment of annexin n, and 3-8 containing a 
;5 fragment of alpha 2 macroglobulin (both provided by T. Ichimura) are amplified by PCR 
with R-Bgl-24 primer, digested with DpnII to remove the oligonucleotides and used as 
quenching reagents. Tester cDNA is hybridized to an excess driver cDNA including the 
four quenching reagents described above. After mung bean nuclease digestion. 
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hybridization mixture is amplified by PCR to enrich the cDNA fragments upregulated by 
ischemia. Each of the PCR steps is tested with a range of template DNA concentrations, 
and the concentration giving the best signal is chosen for the next steps. The DNA mixture 
containing tester and driver DNA for the subtractive hybridization step is washed twice 
5 with 70% EtOH by spinning for 3 minutes. 

The second round cDN A subtraction is performed by hybridizing the first round 
cDNA subtraction product ligated to N-Bgl-12-24 oligonucleotides to an excess of the 
same driver cDNA used for the first round subtraction. The third round cDNA subtraction 
is performed by hybridizing the second round cDNA subtraction product ligated to J-Bgl- 
10 12/24 oligonucleotides to an excess of the same driver cDNA used for the first round of 
cDNA subtraction at a ratio of 1 : 14,000. The final PCR product after the third round of 
subtraction is digested with Dpnll to remove the oligonucleotides and ligated to pCR- 
Script SK(+) vector. This represents a selected cDNA library by RDA. 

3(H). Suppressio n subtractive hybridization (SSH) method to clone short-fragment cDNAs 
15 up-regulated bv ischemia 

SSH is perfomed according to the manufacturer's instructions (Clontech, Catalog 
No. K1804-1) with certain modifications. Briefly, double-stranded cDNA are synthesized 
from mRNA isolated from rat kidney 24 hours or 48 hours post-ischemia as a tester, and 
from nomal rat kidney as a driver with the Superscript Choice™ system for cDNA 

20 synthesis (BRL Catalog No. 18090). The cDNA is synthesized according to the 

manufacturer's instruction except that the first strand cDNA is synthesized at 42°C. cDNA 
is digested with RS AL Tester cDNA after RS AI digestion is ligated to adaptors. For the 
first hybridization, tester cDNA with adaptors is hybridized with excess driver cDNA at 
68°C for 12 hours. For the second hybridization, the mixture is hybridized at 68°C for 22 

25 hours. The cDNA mixture after the second hybridization is initially amplified by PCR for 
27 cycles only. Then, a portion of the PCR product is re-amplified for 10 or 13 cycles. The 
final subtracted PCR product is digested by RS AI to remove the adaptor, and separated on 
a gel. The three bands and the remaining smear are cut out separately and cloned to pCR- 
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Script SK (+) vector (Stratagene, Catalog No. 21 1 188). This represents a selected cDNA 
library by SSH. 

4. Isolation and C haracterization of Subtracted Amplified cDNA Clones 

Colonies are randomly picked from the selected RDA and SSH cDNA libraries, and 
5 plasmid DNA is isolated with Qiagen plasmid kit. Each cDNA clone, referred to as a 
Subtracted Amplified Clone (SAC) is sequenced with the vector primers. DNA sequences 
are checked against GeneBank/EMBL databases for homology by the program BLAST™. 

Southern blots are prepared with the initial PCR products generated from normal 
and injured rat kidneys. The blots are hybridized to the inserts isolated from the SACs to 
10 confirm which SACs are induced by ischemia. 

4. Isolation and C haracterization of potential full length cDNA clones 

4.25 ug of polyA+ mRNA isolated from rat kidney 24 hours post-ischemia, which 
has been purified twice by oligo dT chromatography, is used to generate a cDNA library. In 
order to obtain double-stranded cDNA with EcoRI adaptors, a Superscript Choice® 

15 system for cDNA synthesis (BRL Catalog No. 1 8090) is used following the supplier's 
protocol, except that the cDNA synthesis reaction is carried out at 42°C. The DNA is 
ethanol precipitated, washed, resuspended in 5 ul H20, and ligated to 2 ug EcoRI digested 
and CIAP-treated Lambda ZAPH (Strategene catalog No. 236612). The ligated DNA is 
packaged and used to infect E. coli XL-1 Blue MRF. The complexity of the library is 

20 3.4xl0 6 independent recombinant plaques. Another cDNA library is generated from 4 ug of 
polyA+ RNA from rat kidney 48 hours post-ischemia as described above. 

The two cDNA libraries described above are screened with inserts from the SAC 
clones. pBluescript plasmid vector containing cDNA inserts are excised from Lambda 
ZAPH vector by in vivo excision. Inserts from the longest cDNA clones, referred to as 
25 Potential Full Length (PFL) clones, are sequenced. DNA sequences are checked by the 
program BLAST™ against the GenBank/EMBL databases for DNA homology. Predicted 
protein coding regions (open reading frames, or ORFs) from some of the PFL clones also 
were used to search the GenBank/EMBL databases. 
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Diagnostic Uses of the Compounds of the Invention 

Anti-KIM antibodies of the invention, which specifically bind to a protein of the 
invention or a unique fragment thereof, are useful in several diagnostic methods. These 
agents can be labeled with detectable markers, such as fluoroscopically or radiographically 
5 opaque substances, and administered to an individual to allow imaging of tissues which 
express KIM protein. The agents can also be bound to (conjugated to) substances, such as 
horseradish peroxidase, which can be used as immunocytochemical stains to allow 
visualization of areas of KIM protein-positive cells on histological sections. A specific 
antibody could be used alone in this manner, and sites where it is bound can be visualized 
10 in a sandwich assay using an antiimmunoglobulin antibody which is itself bound to a 
detectable marker. 

Specific antibodies to KIM protein are also useful in immunoassays to measure 
KIM presence or concentration in samples of body tissues and fluids. Such concentrations 
may be correlated with different disease states. As an embodiment of particular interest, 

15 the invention includes a method of diagnosing renal injury, or of monitoring a process of 
renal repair, by measuring the concentration of KIM or of KIM fragments in the urine, 
plasma or serum of a patient. Similarly, KIM can be measured in urine sediment, in 
particular in cellular debris in the urine sediment. Casts of renal tubule cells, which may be 
present in urine sediment from patients with ongoing renal disease, may contain elevated 

20 levels of KIM protein and mRNA. 

Specific antibodies to KIM protein may also be bound to solid supports, such as 
beads or dishes, and used to remove the ligand from a solution, either for measurement, or 
for purification and characterization of the protein or its attributes (such as post- 
translational modifications). Such characterization of an individual's KIM protein is 
25 expected to be useful in identifying deleterious mutants or processing defects which 

interfere with KIM function and are associated with abnormal phenotypes. Each of these 
techniques is routine to those of skill in the immunological arts. 

Additional imaging methods utilize KIM or KIM fragments, fused to imageable 
moieties, for diagnostic imaging of tissues that express KIM ligands, particularly tumors. 
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Further diagnostic techniques are based on demonstration of upregulated KIM 
mRNA in tissues, as an indication of injury-related processes. (See, e.g., PCT publication 
no. WO 97/44460). 

Therapeutic Uses of the Compounds of the Invention 
5 The therapeutic methods of the invention involve selectively promoting or 

inhibiting cellular responses that are dependent on the presence or concentration of any 
KIM, including without limitation KIM ligation, KIM enzyme activity, KIM secretion, 
KIM signalling, and KIM modulation of gene expression. In instances where a KIM and a 
KIM ligand are both membrane bound, and expressed by different cells, the signal 
10 transduction may occur in the KIM-expressing cell, in the KIM ligand-expressing cell, or 
in both. 

KIM ligation-triggered response in a KIM ligand-expressing cell may be generated 
by contacting the cell with exogenous KIM, KIM fusion proteins or activating antibodies 
against KIM ligand, either in vitro or in vivo. Further, responses of the KIM ligand- 
15 expressing cell that would otherwise be triggered by endogenous KIM could be blocked by 
contacting the KIM ligand-expressing cell with a KIM ligand antagonist (e.g., an antagonist 
antibody that binds to KIM ligand), or by contacting the endogenous KIM with an anti- 
KIM antibody or other KM-binding molecule which prevents the effective ligation of KIM 
with a KIM ligand. 

20 Similarly, the responses triggered by KIM ligation in the KIM-expressing cell may 

be promoted or inhibited with exogenous compounds. For example, KIM ligation- 
triggered response in a KIM-expressing cell may be generated by contacting the cell with a 
soluble KIM ligand, or certain anti-KIM activating antibodies. Further, responses of the 
KIM-expressing cell that would otherwise be triggered by interaction with endogenous 

25 KIM ligand could be blocked by contacting the KIM-expressing cell with an antagonist to 
KIM (e.g.., a blocking antibody that binds to KIM in a manner that prevents effective, 
signal-generating KIM ligation), or by contacting the endogenous KIM ligand with an anti- 
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KIM ligand antibody or other KIM ligand-binding molecule which prevents the effective 
ligation of KIM with the KIM ligand. 

Which of the interventions described above are useful for particular therapeutic 
uses depend on the relevant etiologic mechanism of either the pathologic process to be 

5 inhibited, or of the medically desirable process to be promoted, as is apparent to those of 
skill in the medical arts. For example, where KIM ligation results in desirable cellular 
growth, maintenance of differentiated phenotype, resistance to apoptosis induced by 
various insults, or other medically advantageous responses, one of the above-described 
interventions that promote ligation-triggered response may be employed. In the alternative, 

10 one of the inhibitory interventions may be useful where KIM ligation invokes undesirable 
consequences, such as neoplastic growth, deleterious loss of cellular function, 
susceptibility to apoptosis, or promotion of inflammation events. 

Following are examples of the previously described therapeutic methods of the 
invention. One therapeutic use of the KIM-related compounds of the invention is for 

15 treating a subject with renal disease, promoting growth of new tissue in a subject, or 

promoting survival of damaged tissue in a subject, and includes the step of administering 
to the subject a therapeutically effective amount of a KIM protein of the invention, or of a 
pharmaceutical composition which includes a protein of the invention. The protein used in 
these methods can be a unique fragment of a full-length KIM protein, a soluble KIM 

20 variant or a soluble ligand thereof, a KIM fusion protein, or a KIM agonist. These 

methods can also be practiced by administering to the subject a therapeutically effective 
amount of an agonist antibody of the invention, or a pharmaceutical composition which 
includes an agonist antibody of the invention. A KIM protein can be administered 
concurrently with a therapeutically effective amount of a second compound which exerts a 

25 medically desirable adjunct effect. While tissues of interest for these methods include any 
tissue, preferred tissues include renal tissue, liver, neural tissue, heart, stomach, small 
intestine, spinal cord, or lung. Particular renal conditions which are expected to be 
beneficially treated with the compounds of the invention include acute renal failure, acute 
nephritis, chronic renal failure, nephrotic syndrome, renal tubule defects, kidney 
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transplants, toxic injury, hypoxic injury, and trauma. Renal tubule defects include those of 
either hereditary or acquired nature, such as polycystic renal disease, medullary cystic 
disease, and medullary sponge kidney. This list is not limited, and may include many other 
renal disorders (see, e.g., Harrison's Principles of Internal Medicine, 13th ed., 1994, which 
5 is herein incorporated by reference.) The subject of the methods may be human. 

A therapeutic intervention for inhibiting growth of undesirable, KIM- or KIM- 
ligand-expressing tissue in a subject includes the step of administering to the subject a 
therapeutically effective amount of a KIM antagonist (e.g.., an antibody blocks KIM 
ligation). In an embodiment of interest, the KIM antagonist or anti-KIM antibody can be 
10 used therapeutically to inhibit or block growth of tumors which depend on KIM protein for 
growth. 

Other methods of the invention include killing KIM ligand-expressing tumor cells, 
or inhibiting their growth, by contacting the cells with a fusion protein of a KIM and a 
toxin or radionuclide, or an anti-KIM ligand antibody conjugated to a toxin or radionuclide. 
15 The cell can be within a subject, and the protein or the conjugated antibody is administered 
to the subject. 

Also encompassed within the invention is a method for targeting a toxin or 
radionuclide to a cell expressing a KIM, comprising contacting the cell with a fusion 
protein comprising a KIM ligand and a toxin or radionuclide, or an anti-KIM antibody 
20 conjugated to a toxin or radionuclide. Another embodiment includes the method of 

suppressing growth of a tumor cell which expresses KIM, comprising contacting the cell 
with a fusion protein of KIM ligand and a toxin or radionuclide or with an anti-KIM 
antibody conjugated to a toxin or radionuclide; the cell may be within a subject, and the 
protein administered to the subject. 

25 The term "subject" as used herein is taken to mean any mammal to which KIM can 

be administered. Subjects specifically intended for treatment with the method of the 
invention include humans, as well as nonhuman primates, sheep, horses, cattle, goats, pigs, 
dogs, cats, rabbits, guinea pigs, hamsters, gerbils, rats and mice. 
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Use of Compounds of the Invention in Gene Therapy 

The KIM genes of the invention are introduced into damaged tissue, or into tissue 
where stimulated growth is desirable. Such gene therapy stimulates production of KIM 
protein by the transfected or transformed cells, promoting cell growth and/or survival of 
5 cells that express the KIM protein. 

In a specific embodiment of a gene therapy method, a gene coding for a KIM 
protein is be introduced into a renal target tissue. The KIM protein is expected to be stably 
expressed and stimulate tissue growth, division, or differentiation, or to potentiate cell 
survival. Furthermore, a KIM gene can be introduced into a target cell using a variety of 
10 well-known methods that use either viral or non-viral based strategies. 

Once introduced into a target cell, sequences of interest can be identified by 
conventional methods such as nucleic acid hybridization using probes comprising 
sequences that are homologous/complementary to the inserted gene sequences of the 
vector. In another approach, the sequence(s) may be identified by the presence or absence 
15 of a "marker" gene function (e.g, thymidine kinase activity, antibiotic resistance, and the 
like) caused by introduction of the expression vector into the target cell. 

Formulation 

In general, compounds of the invention are suspended, dissolved or dispersed in a 
pharmaceutically acceptable carrier or excipient. The resulting therapeutic composition 

20 does not adversely affect the subject's homeostasis, particularly electrolyte balance. Thus, 
an exemplary carrier comprises normal physiologic saline (0. 15M NaCl, pH 7.0 to 7.4). 
Other acceptable carriers are well known in the art and are described, for example, in 
Remington's Pharmaceut ical Sciences . Gennaro, ed., Mack Publishing Co., 1990. 
Acceptable carriers can include biocompatible, inert or bioabsorbable salts, buffering 

25 agents, oligo- or polysaccharides, polymers, viscosity-improving agents, preservatives, and 
the like. In some embodiments, the term "carrier" encompasses liposomes and the HTV-1 
tat protein (See Chen et al., Anal. Biochem. 227: 168-175, 1995) as well as any plasmid 
and viral expression vectors. 
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Any KIM compound of this invention can be used in the form of a pharmaceutical^ 
acceptable salt. Suitable acids and bases which are capable of forming salts with the 
polypeptides, nucleic acids and vectors of the present invention are well known to those of 
skill in the art, and include inorganic and organic acids and bases. 

5 KIM compounds of the invention are dispersed in the carrier to concentrations 

sufficient to deliver to the subject a therapeutically effective amount of the compound, 
which is an amount sufficient to produce a detectable, preferably medically beneficial 
effect in the subject. Medically beneficial effects would include preventing, delaying or 
attenuating deterioration of, or detectably improving, the subject's medical condition. It is 

10 expected that the concentration or amount of a KIM compound that will produce a 
medically beneficial effect will vary considerably with the circumstances in which the 
invention is practiced. An effective amount can be determined by an ordinarily skilled 
physician or other practitioner through no more than routine experimentation. As an 
example, an indication of the status of renal injury or renal function can be monitored with 

15 one or more routine laboratory tests which measure the concentrations of relevant 

substances in blood or urine, other urine characteristics, or the rate of clearance of various 
substances from the blood into the urine. The parameters measured by these tests, either 
individually or in combination, can be used by a physician to assess renal function or 
damage. Examples of such parameters include the blood concentration of urea, creatinine 

20 or protein; the urine concentration of protein or of various blood cells such as erythrocytes 
or leucocytes; urine specific gravity; amount of urine; the clearance rates of inulin, 
creatinine, urea or p-aminohippuric acid; and the presence of hypertension or edema. 

In some embodiments, a KIM compound is formulated in a liposome delivery 
system, including without limitation any of a variety of unilamellar vesicles, multilamellar 
25 vesicles, or stable plurilamellar vesicles, all of which can prepared and administered 
according to methods well known to those of skill in the art, for example in accordance 
with the teachings of United States Patent 5,169,637, 4,762,915, 5,000,958 or 5,185,154. 
In addition, it may be desirable to express the novel polypeptides of this invention, as well 
as other selected polypeptides, as lipoproteins, in order to enhance their binding to 
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liposomes. As an example, treatment of human acute renal failure with liposome- 
encapsulated KIM protein may be performed in vivo by introducing a KIM protein into 
cells in need of such treatment using liposomes. The liposomes can be delivered via 
catheter to the renal artery. The recombinant KIM protein is purified, for example, from 
5 CHO cells by immunoaffmity chromatography or any other convenient method, then mixed 
with liposomes and incorporated into them at high efficiency. The encapsulated protein 
may be tested in vitro for any effect on stimulating cell growth. 

Routes of Administration 

The compounds of the invention may be administered in any manner which is 

10 medically acceptable. Depending on the specific circumstances, local or systemic 

administration may be desirable. Preferably, the compound is administered via a parenteral 
route such as by an intravascular, intravenous, intraarterial, subcutaneous, intramuscular, 
intratumor, intraorbital, intraventricular, intraperitoneal, subcapsular, intracranial, 
intraspinal, or intranasal injection, infusion or inhalation. The compound also may be 

15 administered by implantation of an infusion pump, or a biocompatible or bioerodable 
sustained release implant, or by installation of a catheter (e.g., in a renal artery), into the 
subject. Alternatively, certain compounds of the invention, or formulations thereof, may 
be appropriate for oral or enteral administration. Still other compounds of the invention 
will be suitable for topical administration. 

20 Treatment Regimes 

Determining appropriate dosage and frequency of treatment with any particular 
KIM compound to be administered to an individual is within the skills and clinical 
judgement of ordinary practitioners. The general dosage and treatment schedule is 
established by preclinical and clinical trials, which involve extensive but routine studies to 

25 determine the optimal administration parameters of the compound. Even after such 
recommendations are made, the practitioner will often vary these dosages for different 
individuals based on a variety of considerations, such as the individual's age, medical 
status, weight, sex, and concurrent treatment with other pharmaceuticals. Determining the 
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optimal dosage and administration regime for each KIM compound is a routine matter for 
those of skill in the pharmaceutical and medical arts. 

Equivalents 

The invention may be embodied in other specific forms without departing from the 
5 spirit or essential characteristics thereof. The foregoing embodiments are therefore to be 
considered in all respects illustrative of, rather than limiting on, the invention disclosed 
herein. Scope of the invention thus is indicated by the appended claims rather than by the 
foregoing description, and all changes which come within the meaning and range of 
equivalency of the claims are intended to be embraced therein. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Sanicola-Nadel , Michele 
Hession, Catherine A 
Wei , Henry 
Cate, Richard L 

(ii) TITLE OF INVENTION: MODULATORS OF TISSUE REGENERATION 

(iii) NUMBER OF SEQUENCES: 112 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Patent Administrator, Biogen, Inc. 

(B) STREET: 14 Cambridge Center 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02142 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/047,490 

(B) FILING DATE: 23-MAY-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/047,491 

(B) FILING DATE: 23-MAY-1997 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Fenton, Gillian M 

(B) REGISTRATION NUMBER: 36,508 

(C) REFERENCE/DOCKET NUMBER: A028PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 679-2810 

(B) TELEFAX: (617) 679-2838 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2058 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW010 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 8. .646 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: complement (1618.. 1885) 
(D) OTHER INFORMATION: /label= SAC_23807 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTCGACA GCT CGG ATC CCC CCA TCA CCT ACC TCA CAG ACT GCT CAG ACA 49 
Ala Arg He Pro Pro Ser Pro Thr Ser Gin Thr Ala Gin Thr 
1 5 io 

GAA GGT AGT AGG ATC ACA TCC AAA ATG CCT GTG GCG TCT ACC CAG GGG 97 
Glu Gly Ser Arg He Thr Ser Lys Met Pro Val Ala Ser Thr Gin Gly 
15 20 25 30 

TCC TCT CCC CCT ACC CCG GCT CTG TGG GCA ACC TCC CCC AGG GCT GCC 145 
Ser Ser Pro Pro Thr Pro Ala Leu Trp Ala Thr Ser Pro Arg Ala Ala 
35 40 45 

ACC CTA CCT CCC AAG TCC TCC TCC ACC GGC ACA TCC TGT GAA CCA AAA 193 
Thr Leu Pro Pro Lys Ser Ser Ser Thr Gly Thr Ser Cys Glu Pro Lys 
50 55 60 

ATG GTC ATC AAC ACG GTC CCA CAG CTC CAC TCG GAG AAG ACA GTG TAT 241 
Met Val He Asn Thr Val Pro Gin Leu His Ser Glu Lys Thr Val Tvr 
65 70 75 

CTC AAG TCC AGT GAC AAC CGC CTG CTC ATG TCT CTC CTC CTC TTC CTC 289 
Leu Lys Ser Ser Asp Asn Arg Leu Leu Met Ser Leu Leu Leu Phe Leu 
80 85 90 

TTT GTC CTC TTC CTC TGC CTC TTT TCC TAC AAC TGC TAC AAG GGC TAC 337 
Phe Val Leu Phe Leu Cys Leu Phe Ser Tyr Asn Cys Tyr Lys Gly Tyr 
95 100 105 no 

CTG CCC GGA CAG TGC TTA AAG TTC CGC TCA GCC CTG CTG CTC GCA AAG 385 
Leu Pro Gly Gin Cys Leu Lys Phe Arg Ser Ala Leu Leu Leu Ala Lys 
H5 120 125 

AAA AAA CCT AAG TCA GAG TTC TCT GAC CTG GAG CAG AGT GTG AAG GAG 433 
Lys Lys Pro Lys Ser Glu Phe Ser Asp Leu Glu Gin Ser Val Lys Glu 
!30 135 140 

ACG CTG GTA GAA CCT GGG AGC TTC TCG CAG CAG AAC GGC GAC CAG CCC 481 
Thr Leu Val Glu Pro Gly Ser Phe Ser Gin Gin Asn Gly Asp Gin Pro 
145 150 155 

AAG CCA GCC TTG GAT ACC GGC TAT GAA ACC GAG CAG GAC ACT ATC ACC 529 
Lys Pro Ala Leu Asp Thr Gly Tyr Glu Thr Glu Gin Asp Thr He Thr 
160 165 170 

AGC AAG GTC CCC ACC GAT CGA GAG GAC TCG CAA CGT ATC GAC GAG CTC 577 
Ser Lys Val Pro Thr Asp Arg Glu Asp Ser Gin Arg He Asp Glu Leu 
175 180 i 8 5 190 



-36- 



SUBSTITUTE SHEET (RULE 26) 



WO 98/53071 



PCT/US98/10547 



TCC GCC AGG GAC AAA CCG TTT GAT GTC AAG TGT GAA CTC AAG TTT GCA 625 
Ser Ala Arg Asp Lys Pro Phe Asp Val Lys Cys Glu Leu Lys Phe Ala 
195 200 205 

GAC TCG GAT GCC GAC GGG GAC TGAGGCCAGT GTGTCCCAGC CCGTGCCCCT 676 
Asp Ser Asp Ala Asp Gly Asp 
210 

CCGTCTTCGT GGAGAGTGTT GTGTTGAACC TATTCAGTAG CCGAGTCTTG TCACTGTGCA 736 

CCAGCCTCAG TCTTTTGTCC CTTTCTCTCT TGGGTTGAGC CTGTGACTTG TCCCCTTTGT 796 

CCTTTTGGGA AACAAGTATC TATTCAAGCC TCAAGTCCCG GCAGTTGTTG GAGCGCTTAC 856 

TACACACCTG AGCCCTTTGT GTCCTGGGGG AGAGATGGCC ACCTCCGTGG GCTGAGAAGA 916 

ACCACCCCTT CCTCTTCCGC TCCTCGAAGC AGCCACTGAG AGATAATTTA ATTCCAGATT 976 

GGAAATGACC TTTTTGGTTT ATCAGATTGG TAACTTAATC TCCTGCCATC CGGGTGGCAC 1036 

GGACGTTTTT CTTTCACTTC GTTATTTTTT TTAGGATTTG CGCTCCTACT GTGTTGATGT 1096 

CTTAGGTCAT TTTTTTTTTT TTAAGTTACC AGAGGAGATG TTTTGATATT CATGAGAAGA 1156 

AGAACATTTT CTAGATTTTT TTTTTTTTTG GTTACATATT GAGATAAAAT ATGCCTCTGT 1216 

TGCTTAAGAT TCTCAGGGAT AGACGTGATT TTTGCTAATT TCTTTCCTGC TGTTCGGAAC 1276 

GTAGGCCTAA AACTGTCTCT TGAGCTCACA CACTCACCCT TTCCTTTTGG TGGGTTTTTT 1336 

TTTTTTTTTG TTTTGTTTTA TTAATCTTAC TCATTTCGAA GGATTTTCTT TATGAGCTTT 1396 

TGTTTTGTTT TTGTTTTGTT TTTTTGTTTT TCTCTTTTTC CTGCCGTACA TCATCTACGA 1456 

GGTGCGTTTT GAGTGAGGGC AGATGGCCCA GTGGCTTCGG GTGGCCCAGT GGCTGTGGGT 1516 

GGCAGTTGAG CTGGTCCTGT GAGGGGAGGA GGGCTTAGAC GCCACGGCCC TGCTGCTTCT 1576 

CGGCACCTCC TGTCCCGATG GGTGGCGTCT TCATGAGGAT TACATCCTGT CTCTGCTGGC 1636 

TTCCATCTCG CACCTCTGCC CTTGGACTTC CACCTTGACT GTCCACAAAA GACAGAAATG 1696 

GGCTGGCTAG TTGGGCTCCC GGCCTCGGAC GGTGGCCGCA ACGTCCCGTG TGGGCGGCTG 1756 

CCGCTGCAGC CTGACTCCTG CCGGTGTCTT TCAGGATGTG AACGGGTGGT ACGAATCTTG 1816 

ACATTTGTTT CTCACTTCCT GTGTATGAAA CGACACTCAT TCCATGTAGA GGGTGACGGA 1876 

CTCTGGATCC CCCCCCTTGT CGTGTAGACA CTCATCTTCA GCGTGACCTG GTCCTGCCAT 1936 

TCGGTGTAAA CATTTGTGTT TATAAGATTT ACTTTGTTTT TATTTTTCTA CTTGGAACTG 1996 

TACATATTTG AAAAAGTACC CAAATAAACC AGAAGCTTTA TCGTTGAAAA AAAAAAGTCG 2056 

AC 2058 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Ala Arg lie Pro Pro Ser Pro Thr Ser Gin Thr Ala Gin Thr Glu Gly 
15 10 15 

Ser Arg lie Thr Ser Lys Met Pro Val Ala Ser Thr Gin Gly Ser Ser 
20 25 30 

Pro Pro Thr Pro Ala Leu Trp Ala Thr Ser Pro Arg Ala Ala Thr Leu 
35 40 45 

Pro Pro Lys Ser Ser Ser Thr Gly Thr Ser Cys Glu Pro Lys Met Val 
50 55 60 

lie Asn Thr Val Pro Gin Leu His Ser Glu Lys Thr Val Tyr Leu Lys 
65 70 75 80 

Ser Ser Asp Asn Arg Leu Leu Met Ser Leu Leu Leu Phe Leu Phe Val 
85 90 95 

Leu Phe Leu Cys Leu Phe Ser Tyr Asn Cys Tyr Lys Gly Tyr Leu Pro 
100 105 110 

Gly Gin Cys Leu Lys Phe Arg Ser Ala Leu Leu Leu Ala Lys Lys Lys 
115 120 125 

Pro Lys Ser Glu Phe Ser Asp Leu Glu Gin Ser Val Lys Glu Thr Leu 
130 135 140 

Val Glu Pro Gly Ser Phe Ser Gin Gin Asn Gly Asp Gin Pro Lys Pro 
145 150 155 160 

Ala Leu Asp Thr Gly Tyr Glu Thr Glu Gin Asp Thr lie Thr Ser Lys 
165 170 175 

Val Pro Thr Asp Arg Glu Asp Ser Gin Arg lie Asp Glu Leu Ser Ala 
180 185 190 

Arg Asp Lys Pro Phe Asp Val Lys Cys Glu Leu Lys Phe Ala Asp Ser 
195 200 205 

Asp Ala Asp Gly Asp 
210 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
<B) CLONE: HW011 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GTCGACATGA 


TTGCTAGTGG 


TTCTGCTGGC 


TGTGAGGGCT 


CCCTGTCTGT GAGATCTTGT 


60 


TCGGGCTTTC 


CACATCCAGG 


ATCCATCCAT 


TCACCCCTCA 


CCCACTCACC 


CTGTGGAGCA 


120 


TAAAACCTTT 


GCAGTCAGCC 


AGCATGGCAG 


AATGTCTATG 


CCTTGCTTCA 


GTTTCCAGGG 


180 


TAACTAAGCA 


TGTTTTAATC 


CCACTGCCCC 


ACCCACTATT 


GCTCATTCTG 


TCTAGTCTTC 


240 


TGCTACCCAG 


CTTGTGTGGC 


TCCGTTCCTG 


TCACGCCTTC 


AGACGTCACT 


GTTCTGTGAC 


300 


AGCGGTCTCT 


TTCCCCTCTG 


CTCCACCCAT 


TCTGAGCTCT 


GGGATAGTCC 


TAAGCCCTGC 


360 


CCAGAGTAGG 


TGGACTGGTC 


GTTCCATGTC 


CGTGCTTGTG 


GGAGTTGTGG 


GGGTCTGTGG 


420 


CATCTGCTAA 


GGAGGTTTAT 


CCTCACGTCA 


CGGAATCCGG 


CAAGATGCGC 


CATGGTCTAC 


460 


ACTGCTTTCT 


CGAGCAGCGG 


CGAGAAAGGC 


CAGGATAACA 


CACGCCTCTC 


TGCCCTGAGA 


540 


TAACAAACTG 


TCTGAGAAAG 


GAAGTGGCCC 


GTGTCTGGAA 


CATTGGCCCC 


AGTAAAGCAG 


600 


AGGAGACACT 


GCCATGTTTC 


CGTCATTCTG 


TTTCACTGCC 


TTGCTCTGGG 


GCTCAGGGGT 


660 


TGAAGAATGA 


GGATACACCA 


TTCGACCTTC 


AGGCCTTCCT 


CAGACCCTCT TACAAAGCCT 


720 


GTTGTTGTGA 


GTCCAGCTCA 


TCAGCATGGC 


CGCCTTGTTC 


ACAGTTTAGC 


ACCAGTGTCT 


780 


GCAACCGTGT 


CTGAAACGTA 


GCAGATGTCC 


AGTACTTTTT 


GTAAATGAAA AAATGAACAG 


840 


AGTGAAGAGA 


AAGAGTCACC 


ATGAAATTTA 


CTAGTCCAGT 


GAGCTGACCG 


TAGGGGCAGT 


900 


ATAAACATCC 


CCCAGCTCTT 


AATGTTCTGT 


CCTCAAGCTG 


TCCCCTCCCA 


CCTCTCCGCT 


960 


GGCAGGAGAG 


TGCTCTGTAT 


ATGTGGACAG 


TCGTTGAATG 


GCAGGTCCAG 


GATGTCCTCT 


1020 


GCAGGTGGAC 


AAAAGAGAAT 


TCTGCTTCCC 


GGAAGTCTCC 


TTTGCTGGTG 


ATCTCCATGT 


1080 


CCATCCTCAC 


TGAGCACTAG 


AATTAAAACC 


ACTCAATAAT 


GTGATATGTG 


TGGTCACCTT 


1140 


CAGTTCAGTA 


GCAGTATTTC 


CTGGTCATTA 


ATATTCTGCA 


TTTTATTATG 


GTAAGATAGT 


1200 


ATAAGAAAAT 


GGTGGGCTGG 


TATATAGCCA 


GCCATGAGAC 


CACTCCTTAC 


GTCAAGGACT 


1260 


GGCAGAGTCT 


GAGGGACCTG 


AGTCTGGGTC 


CTCCTCCTGT 


CCCCTCCTCC 


TGTAGCTATG 


1320 


GTGGCCTTCA 


GTATTCTGAG 


ACTGAAGGCT 


ACTACTTCAG 


GACACCTACT 


TGATCTTAGT 


1380 


ACTCTCAGAA 


CAGCTCTCCT 


CAGTTCTAGC 


TGTGATCGCG 


GGCTGCGTTT 


GGAACGTTAA 


1440 


CGTACTAATG 


GGTTTAAGAC 


TTTCAGAAAG 


TGATTTTAAA 


ACAAAATGAC 


TGGAAGTTTT 


1500 


GTCTGATAAA 


GCAAGTAGAA 


TGGAATGTCT 


CAGACTGTCT 


CGTCACATTT 


TCATGTTTCA 


1560 


TACGAAAAGA 


AAAGATTGAT 


AACCTATCTT 


TAGAAATAGT 


AATGTTAGTT 


CTTTCCTATT 


1620 


TTGTTTAAAC 


CTTGAAGAAG 


GTTTTATTTA 


ACTCGTGAAC 


TTGTTCCTTT 


GTTATGACAT 


1680 


GATTGTTCCA 


GATTTGAGGG 


ACAAAATAAA 


ATTAACCTTT 


CTAAAAA 




1727 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW012 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 39.. 671 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GTCGACGCAT GGGTGCAGGT GACAAATGAG CTGAAGAA ATG ATC CCC AGA GGA 53 

Met lie Pro Arg Gly 
1 5 

ATA GGA TTG TGG CTG CCT TCA GCT CTG CTC CTC TCT CGG GTC CCA GGC 101 
lie Gly Leu Trp Leu Pro Ser Ala Leu Leu Leu Ser Arg Val Pro Gly 
10 15 20 

TGT TTC CCT CTG CGT GGC CCC AGC AGT GTG ACA GGC ACT GTG GGG GAG 149 
Cys Phe Pro Leu Arg Gly Pro Ser Ser Val Thr Gly Thr Val Gly Glu 
25 30 35 

TCC CTG AGC GTG ACT TGT CAG TAT GAG GAG AGA TTC AAG ATG AAT AAG 197 
Ser Leu Ser Val Thr Cys Gin Tyr Glu Glu Arg Phe Lys Met Asn Lys 
40 45 50 

AAA TAC TGG TGC AGA GGG TCA CAT GTT CTA CTT TGC AAA GAT ATT GTC 245 
Lys Tyr Trp Cys Arg Gly Ser His Val Leu Leu Cys Lys Asp He Val 
55 60 65 

AAG ACC GGA GGC TCA GAA GAA GCT AGA AAT GGC CGA GTG TCC ATC AGG 293 
Lys Thr Gly Gly Ser Glu Glu Ala Arg Asn Gly Arg Val Ser He Arg 
70 75 80 85 

GAT GAT CCA GAC AAC CTG ACC TTC ACA GTG ACC TTG GAG AGC CTC ATC 341 
Asp Asp Pro Asp Asn Leu Thr Phe Thr Val Thr Leu Glu Ser Leu He 
90 95 100 

CTG GAG GAT GCA GGC ACC TAC ATG TGT GGG GTG GAT ATA CCA TTT ACT 389 
Leu Glu Asp Ala Gly Thr Tyr Met Cys Gly Val Asp He Pro Phe Thr 
105 110 115 

AAT CAC CCC TTG GGG ATT GAT GAG TTC TTC AAG GTT GAA TTG TCT GTG 437 
Asn His Pro Leu Gly He Asp Glu Phe Phe Lys Val Glu Leu Ser Val 
120 125 130 

GTC CCA GGT TCA AGC CTT TGG AGC AGT ACA ACA GTC CCA GAG ACC ATC 485 
Val Pro Gly Ser Ser Leu Trp Ser Ser Thr Thr Val Pro Glu Thr He 
135 140 145 
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AGA TCC TCA CTG GTT CAT ACT CAG CCC AGT GTG ACC ACA GAA GAC AC A 533 
Arg Ser Ser Leu Val His Thr Gin Pro Ser Val Thr Thr Glu Asp Thr 
150 155 160 165 

ATT CCT GCT CCT AGT GCA CGG CCT CGG TCT CTT CTG GGC AGC CTC TAC 581 
lie Pro Ala Pro Ser Ala Arg Pro Arg Ser Leu Leu Gly Ser Leu Tyr 
170 175 180 

CTC TGC ATC CTG GTG TTT CTG GAG TTG CCC CTG TTC CTG TGC ATG CTC 629 
Leu Cys He Leu Val Phe Leu Glu Leu Pro Leu Phe Leu Cys Met Leu 
185 190 195 

TGT GCC GTC CTG TGG GTG AAC AGG CCT CAG AGG TGC AGT GGG 671 
Cys Ala Val Leu Trp Val Asn Arg Pro Gin Arg Cys Ser Gly 
200 205 210 



TGACATAGCA 


TTGGGCCCTG 


TCATGAGTAC 


CAGTGAAGTC 


TGTTGATATG 


GAGGCCCTGT 


731 


CCCTGGATGC 


ATCACCTCTG 


GTGGCCAAGG 


ACAACCAATA AAGCTGTGTT 


CTTGAGAATG 


791 


CTCTGAGACT 


TTTAGAAGAT 


TCTCTGCACG 


TGTCAAACAC 


ATAAAAGAAT 


GTGCCCAATA 


851 


AATACCAGCT 


GGGGAATTGG 


CTCAGCAGGT 


AACAACAATG 


GCTGCTTTTC 


CAGAGGTCCT 


911 


GAGTTCAATT 


CCAATAACTA 


CATCGGTGGC 


TCACAACCAT 


CTCTAATGAG 


TTCAGATGCC 


971 


CTCTTCTGCT 


GTGCAGGTGC 


ACATTCACAT 


AGAATATTCA 


TACATTGAGT AAATTTATTA 


1031 


AAAGTAATTG 


CAGAATTAAA 


ACACAAAAA 








1060 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met He Pro Arg Gly He Gly Leu Trp Leu Pro Ser Ala Leu Leu Leu 
15 10 15 

Ser Arg Val Pro Gly Cys Phe Pro Leu Arg Gly Pro Ser Ser Val Thr 
20 25 30 

Gly Thr Val Gly Glu Ser Leu Ser Val Thr Cys Gin Tyr Glu Glu Arg 
35 40 45 

Phe Lys Met Asn Lys Lys Tyr Trp Cys Arg Gly Ser His Val Leu Leu 
50 55 60 

Cys Lys Asp He Val Lys Thr Gly Gly Ser Glu Glu Ala Arg Asn Gly 
65 70 75 80 

Arg Val Ser He Arg Asp Asp Pro Asp Asn Leu Thr Phe Thr Val Thr 
85 90 95 

Leu Glu Ser Leu lie Leu Glu Asp Ala Gly Thr Tyr Met Cys Gly Val 
100 105 110 
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Asp He Pro Phe Thr Asn His Pro Leu Gly He Asp Glu Phe Phe Lys 
115 120 125 

Val Glu Leu Ser Val Val Pro Gly Ser Ser Leu Trp Ser Ser Thr Thr 
130 135 140 

Val Pro Glu Thr He Arg Ser Ser Leu Val His Thr Gin Pro Ser Val 
145 150 155 160 

Thr Thr Glu Asp Thr He Pro Ala Pro Ser Ala Arg Pro Arg Ser Leu 
165 170 175 

Leu Gly Ser Leu Tyr Leu Cys He Leu Val Phe Leu Glu Leu Pro Leu 
180 185 190 

Phe Leu Cys Met Leu Cys Ala Val Leu Trp Val Asn Arg Pro Gin Arg 
195 200 205 

Cys Ser Gly 
210 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW013 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



GTCGACTTTT 


TTTAACAAAG 


ATGATTTAAT 


ATTTAATATC 


ATACCCAGAA 


AAATCAAACA 


60 


ACTTCCAGAA 


TCAGACAGGT 


AAGATTCCTT 


AGTATGTTTT 


TACCAAAATT 


TTATAAGATT 


120 


ATCTTAAATT 


TGAAGAATTG 


TGCATAAATA 


ACACAGGATA 


CCAAAGGAAA 


CTGAGCTGTA 


180 


TTTTTTGCTT 


TGATGGGAAA 


AAAAAAAAAA 


AAAACCTGAA 


ACCCAGAGAC 


ACTAACCATT 


240 


TTGTGCACTG 


TATAACTCAT 


TGTGGAAGCT 


GAAACTTAGA 


ACTCATCTTT 


GGAAATATAT 


300 


AATTGATGAA 


CCCACCTTTC 


CATTCAGCTT 


TTAGGGTGAT 


GTAAAACTGT 


ATGAATCCAC 


360 


TTACAAATAA 


ACTTAAAAGA 


CCTTTTCATG 


TTACCCTATC 


TCTTATCTTC 


TCAATGTATT 


420 


TCTATATGCT 


TACCTTTAGT 


AAGAGTACCA 


TGGAGTACTT 


TGATAGATGG 


AATGACAGAG 


480 


AAATGGATGT 


TTAAAGATCT 


TTAGGATGAA 


AAGGATAATT 


TAAGAAGCAT 


ATTTAAGAAG 


540 


CATGCTGATA 


TTGAAAAGGT 


AGTATAGGCA 


AGTACAGTGC 


ACTTCCACAG 


ACTACTCATA 


600 


CCTTCTTGTC 


CATCAGAAAG 


TCTGAGTGGG 


TTAGTTACAG 


TAAGGTCATG 


TTGATCTGGG 


660 
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TCTATGCACC 


TGTGCTGAGG 


TCAGAGGACA 


ACTTTCAGGA 


GTCTGTTCTG 


CCCTTCCGTG 


720 


TGGGTCCTAA 


GTATGCAAC T 


CAAAGGTTGT 


CTGIXAGGTT 


GATTCTGCCC 


AGTCATGAAA 


780 


CCTTTGAGAA 


GAGGAAAAGG 


TGTAGGAAGA 


ACAGGAACAG 


GATCATAGAA 


GTGACTAAGC 


840 


CACjAATCTGT 


CATATTTGAG 


GAAACATCAA 


GAGGTGCCTA 


TAATAGGGTT 


CCGAGAATGA 


900 


AAAAAAAAAC 


CAGATAACTT 


TACTGTCTAT 


TCCAGCCAGT 


TTGACTTAGA 


GGCACTGGCA 


960 


AAGTAGGTAC 


AGACAGCTGA 


GGTAGAAAAT 


GTGGAACATT 


ACTGTGGCAA 


CTGCTGCATC 


1020 


TTCGGGTGTG 


GAAGTCACAC 


CAATTCCACT 


TTTGGTTGGC 


ACTTCCCATC 


TTCTAAGAGA 


1080 


GAACCTTTAA 


TTGAGCGGCC 


^>H%J^| 1 1 S*m 1 1 iiii 

GTGAAGAACT 


TGATAAATCT 


ACGCTGCTGT 


GTGTCTCTCA 


1140 


CCACCACCCT 


ACCTACATAG 


ACACCCCCAT 


TTAGTTGTTT 


CCTGAGAATC 


AGAACAGAGC 


1200 


ACTTACACGA 


GGTACGGGAA 


GCTGTGATGC 


CTAGTGTAAG 


AAGTTAGAAG 


ATGACTGAAC 


1260 


ACAGGAAGAG AGCGGGCTTC 


CAGTGGGATC 


TAAATCAAAT 


CAGCGAGGAT 


CACTGGAAAC 


1320 


TGAGCTGAAA CTCAACTAAG 


ATTTAGCTTG 


CAAACAATAG 


AGATGTTTCA 


TTTTAAAACC 


1380 


ACAGCAGATT AACAGTATAG 


CTGGGAAACA 


ACCAAGTGTT 


CTTTTCAAAT 


TTCAGTTCCA 


1440 


CTATATCTGA 


GCATGGTTAA 


AATAGGGGTG 


AGTCCTATTT 


CATGGCTTTT 


ATCTATTAAA 


1500 


TATTTGTAAA 


CGTGCTCAAG 


TTCCCATTTC 


TTTAACCCTA 


CAAAATTCAG 


GCTTCTCACA 


1560 


CTGGGGCTGT AGTTCTTTAA 


TAGTTTTCTA 


ATTCCTGAGC 


CATCTTTAGA 


CTTCTGAAAA 


1620 


CCTCAACTTC 


AGTCTCTTCT 


CACATAGCCA 


ACTATGAGAA 


CCTCCAAAGC 


AGAATGCAGT 


1680 


TTTCCTTGTG 


CCTTTAGCAA 


CAAGGCACCT 


TTACAAAACT 


GGTTCTCAGT 


GAATGAAATG 


1740 


AAAGCATGAT 


CTTACAGTGT 


GCGACACTTC 


AGTAGTGATG 


ACTTAGAATA 


GTTTTAAAAT 


1800 


AAATAAAATT 


ACAGAACCTA 


GGGGAGGTGA 


AAAATGGGAA 


ACATGAAACA 


GTAGACCATA 


1860 


CTGCAGCTCA 


CAAAACAAGC 


AGTGAAAACA 


TATAAACTAC 


CTATAATAGC 


CTCATTATAA 


1920 


CATGCAAACA CACTAACTCT 


GTTACATACA 


GATTGGATGT 


ATATCTGAAG 


TTGTTTATAG 


1980 


ATACAATGGG 


TTTAAGTTGG 


m i i mm i « m « •» 

TAATTAATAA 


ATAGTACTAC 


CTTAGAATGT 


GTTGTGCTTT 


2040 


TAAGTATTAT GAAAGCTCAA 


GAGCCAGGCA 


TCTTCCTCTT 


CAGAGCTGTT 


TCCTGTTACA 


2100 


TAATTACTGG 


CAAAGTTCTA 


AACAGTTCTT 


TACTAAATCA 


CAAAATGCAT 


ACAGCTAGTG 


2160 


CCATTGTCTC AAGTCTTTTT 


AVa AA 1 AG AAA 


TGTCGATGTG 


AAGCTATAAT 


TTTATTTTGG 


2220 


TTTTATGATC 


CTGCAAGGGT 


TAAGTTTGTC 


TGATTCTTCC 


CAGGGTACTG 


ATAAGTTCTC 


2280 


TAACATACCG 


GAGTAAGAGA 


ACTGAAACAG 


GTTATTTCCC 


TCATCCTCAG 


GAATCTGCTT 


2340 


GAATATGGAT 


CAGCATATAT 


TGGACTTAAT 


GCTGCTTTTG 


GGGGCCTAAT 


AGCAAACAGT 


2400 


CTATTTCGGC 


GAATTTTGAA 


CGTGACACAA 


GCTCGAGTAG 


CATCTAGCTT 


GCCAATGGCT 


2460 


GTGATCCCAT 


TTTTGACAGC 


AAACCTGTCC 


TACCATAGTT 


TTGTAAGTTT 


ACCTTTGAGT 


2520 


ACAGGTATGT 


TTTATTTCAT 


TATCATCCAA 


GCCTGCATTT 


ATGCATACTA 


TATACACTTA 


2580 
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TATATACTTC 


ACTCAAAACT 


GAAGGATTAG 


AACCGAGTAG 


GTGTGAGCTT TAATGAAGTG 


2640 


TTTAGGGAAG 


ACCAAATCTA 


TTCAACAACT 


GATTCAGCTT 


AGGTCTGCCA ACCAACCAGT 


2700 


GAGTACATAT 


CCACAGCCGT 


CAGTCACACA 


CTTCCCTGAC 


CTGAAGCAGC ATTCCCAGTG 


2760 


CTGCCTTTGT 


TTCTTGTTAA 


TTTGCACATT 


ATATGCAAGT 


TTTCTCTTTT TAAAAAGATG 


2820 


TTTTTATTTA 


GGTATGTGTT 


TTGTCTGACT 


GTACATTTGC 


ACACCAGGTA GTGGTACTGG 


2880 


ACCCCATATA 


CTAGTTACAG 


ATAAGGGTGA 


GCTGCCACGT 


GGGTGCTGGT AATTGAACTC 


2940 


AGCACTTAGT 


GCCATTGCTC 


TTAATTGAGA 


TCTCTCTCCA 


GCCCTGTATG AAAGTTCTTT 


3000 


AAAGTATAAG 


TATGTTGTGG 


GAATACAACT 


CTGTAATCTC 


TGAAGTACTA TGTGCATTGC 


3060 


AGCATTTAGC 


AAAATGAAAG 


ACATAGTGTC 

Owl X rVV3 X VJ X v*. 


CCATGTGTCC 


TGAACATGTA TTGTTATGTG 


3120 


TTCTATATGC 


ATGACCCTTC 


TTATTAAAAA 


CAGATTGGGG 


AAAGGGGTCA ATTTTGTGTT 


3180 


TTCTCCTAAA 


TAATAATCCA 


AATTTCTGAG 


AATCCAGAGT 


CAAGATTTGT TTTCCTGTTT 


3240 


ATCAGTATTT 


CCTATCCAAG 


GCCTTAATAC 


ATGCTAAGCA TGTATTCACA CCAAGCTACA 


3300 


GTCCATCCTC 


TATAATTATT 


CCTTTGACAT 


GATATGCACC 


ATTTTTGCTG TGTCACTGAA 


3360 


AATAAAATGT 


GAACCTTATG 


TCGAC 






3385 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW014 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 175.. 804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GTCGACCTGG AAAGAGCTGC AGTGTTCGCG CTTGGTAGCT GGTGGACCGG GCTCAGCACC 60 

TTCGGCTGGC TTTGTGTCCC AGAGGCTCAC CGGAAAAAGA CTTTCTCAGC CCTCGGACTC 120 

CAGAGAGAGA TACTTTTCGT GCTCCCCAAC CTTGACTCCT GCAGAAGCTG AGCG ATG 177 

Met 
1 

GCG TCC ATG GGG CTG CAG GTG CTG GGA ATC TCC TTG GCC GTC CTG GGC 225 
Ala Ser Met Gly Leu Gin Val Leu Gly lie Ser Leu Ala Val Leu Gly 
5 10 15 
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TGG TTA GGA GTC ATC CTG AGT TGT TCG CTC CCC ATG TGG CGG GTG ACC 273 
Trp Leu Gly Val lie Leu Ser Cys Ser Leu Pro Met Trp Arg Val Thr 
20 25 30 

GCC TTC ATC GGC AGC AAC ATC GTC ACG GCG CAG ACG AGC TGG GAA GGC 321 
Ala Phe lie Gly Ser Asn lie Val Thr Ala Gin Thr Ser Trp Glu Gly 
35 40 45 

CTG TGG ATG AAC TGC GTG GTG CAG AGC ACC GGC CAG ATG CAG TGC AAG 369 
Leu Trp Met Asn Cys Val Val Gin Ser Thr Gly Gin Met Gin Cys Lys 
50 55 60 65 

ATG TAC GAC TCG ATG CTG GCC CTG CCG CAG GAC CTG CAG GCC GCC CGA 417 
Met Tyr Asp Ser Met Leu Ala Leu Pro Gin Asp Leu Gin Ala Ala Arg 
70 75 80 

GCC CTG ATG GTC ATC AGC ATC ATC GTG GGT GCT CTG GGG ATG CTT CTC 465 
Ala Leu Met Val He Ser He He Val Gly Ala Leu Gly Met Leu Leu 
85 90 95 

TCA GTC GTA GGG GGC AAG TGC ACC AAC TGC ATG GAG GAC GAG ACC GTC 513 
Ser Val Val Gly Gly Lys Cys Thr Asn Cys Met Glu Asp Glu Thr Val 
100 105 110 

AAG GCC AAG GTC ATG ATC ACA GCC GGA GCC GTG TTC ATC GTG GCA AGC 561 
Lys Ala Lys Val Met He Thr Ala Gly Ala Val Phe He Val Ala Ser 
115 120 125 

ATG CTG ATT ATG GTG CCT GTG TCC TGG ACG GCA CAC AAC GTC ATC CGC 609 
Met Leu He Met Val Pro Val Ser Trp Thr Ala His Asn Val He Arg 
130 135 140 145 

GAC TTC TAC AAC CCT CTG GTG GCT TCC GGG CAG AAA AGG GAG ATG GGG 657 
Asp Phe Tyr Asn Pro Leu Val Ala Ser Gly Gin Lys Arg Glu Met Gly 
150 155 ~ ~ 160 

GCC TCG CTT TAC ATC GGC TGG GCG GCT TCT GGG CTG CTG CTC CTG GGA 705 
Ala Ser Leu Tyr He Gly Trp Ala Ala Ser Gly Leu Leu Leu Leu Gly 
165 170 175 

GGA GGC CTC CTC TGC TGC AAT TGC CCA CCT CGC CGC AAC GAA AAG CCC 753 
Gly Gly Leu Leu Cys Cys Asn Cys Pro Pro Arg Arg Asn Glu Lys Pro 
180 185 190 

TAC TCC GCC AAG TAC TCC GCC GCC CGC TCT GTT CCC GCC AGC AAC TAT 801 
Tyr Ser Ala Lys Tyr Ser Ala Ala Arg Ser Val Pro Ala Ser Asn Tyr 
195 200 205 

GTG TAAGGTGGGC CACTCTCTGT CCACATTGCC TTTATTTTCT TGGATTGAAC 854 

Val 

210 

TCATAACGGC CTGTGGCCCC TCACATTCTC CAGGACCTGA CCAGCTGTGG GCTACTGACT 914 

GCTTGCAAAC CCGGACTGTG CTAAGTTACT AGCGTGTAGC CCTTGGGGAC CCACCTGGCC 974 

CATCTGGACA CATCTCAAGG CTCCAGCGAG GATAGATGTA AAAATATTTC CTTGCTTGCA 1034 

TCCAGATTGC TCATGGATAC GGGGCTGAAG GCAGAAGCAG CTGTCTGGGT ACGACAGTGG 1094 

AGGGGGAGCT GGGTCCTGCT GGCCGGGATA GCTCAGCTGT GACTTTGGTC TCTGGAGTGG 1154 
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ATGTCCTGGT 


/•> « HV^fffFH TV f**C TV 

CATVjTTAGCA 


TA TV T* HIIIU » Ik /~im 

AACATTCACT 


GCCCTTTCTC 


AGTGCCCTCG 


CTCTCTCGCC 


1214 


I\-UALUT1 AC 




CTCTTGCCGT 


TTCTCGCCCG 


CGTTTCTGAG 


/"*TA O TV f>f* TV /""»^"*fTV 

CACACCAGGT 


1 »v*T 4 

1274 


CCTGCCTGGA 


GTCTTGGTGT 


CvjAGGATGAC 


mn TV omp IV X 

TGACTGAAGG 


GGCCTTTGAG 


AGCTGATGGG 


1334 


T\l CCATvj 


o Af" t *TVT , 'TY" , f"V' , 


/Vviflp TA IIUII TA ^ ^ 

UvjTGATTAGC 


TV TV fTV^ TV t VI V^^"»/" > » 

AATGACTGGG 


GCCTTACCCA 


CCCACCTACC 


1394 


rffl^^fnit TAFTtoiA 
CTCGTAATGA 


AQjFJXTGTGG 


TV ^MW^/ 1 IIIU IV 

AGTGGCTGGA 


CAGGTTTGAG 


GGAAGGGTGG 


AGGTGGTTTA 


1454 


AACTGGTTTG 




GGGCTGGGGA 


CCCAGAAGCA 


GCCCAGGGTG 


TCCCCACCCC 


1514 


TTTCCCATAC 


GGTCTTGCTA 


AATGTTCTGA 


TCTCTGTATA 


CCCCCTCCCT 


CTTCAGAAGG 


1574 


ACCCTGGGTG 


GGCCCCTCTG 


AATTCCCTAC 


CCTTGTCCAT 


TTCAAGGACG 


CTGGCCAGTC 


1634 


TGTGGAAGGT 


ACGGGGGTCT 


GATGGCATTG 


CACCAGGGAG 


CCTCCTGGAC 


TCCCTTGCCT 


1694 


TCTCTGTGGT 


TTCTTGTTTT 


GTAATTTAAG 


GTCTGTTCAC 


AGCTGTAATT ATTATTTTCT 


1754 


ACAATAAATG 


GCACCTGCAG 


TCGAC 








1779 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Ser Met Gly Leu Gin Val Leu Gly He Ser Leu Ala Val Leu 
15 10 15 

Gly Trp Leu Gly Val He Leu Ser Cys Ser Leu Pro Met Trp Arg Val 
20 25 30 

Thr Ala Phe He Gly Ser Asn He Val Thr Ala Gin Thr Ser Trp Glu 
35 40 45 

Gly Leu Trp Met Asn Cys Val Val Gin Ser Thr Gly Gin Met Gin Cys 
50 55 60 

Lys Met Tyr Asp Ser Met Leu Ala Leu Pro Gin Asp Leu Gin Ala Ala 
65 70 75 80 

Arg Ala Leu Met Val He Ser He He Val Gly Ala Leu Gly Met Leu 
85 90 95 

Leu Ser Val Val Gly Gly Lys Cys Thr Asn Cys Met Glu Asp Glu Thr 
100 105 110 

Val Lys Ala Lys Val Met He Thr Ala Gly Ala Val Phe He Val Ala 
115 120 125 

Ser Met Leu He Met Val Pro Val Ser Trp Thr Ala His Asn Val He 
130 135 140 

Arg Asp Phe Tyr Asn Pro Leu Val Ala Ser Gly Gin Lys Arg Glu Met 
145 150 155 160 



46 



SUBSTITUTE SHEET (RULE 26) 



WO 98/53071 



PCT/US98/10547 



Gly Ala Ser Leu Tyr lie Gly Trp Ala Ala Ser Gly Leu Leu Leu Leu 
165 170 175 

Gly Gly Gly Leu Leu Cys Cys Asn Cys Pro Pro Arg Arg Asn Glu Lys 
180 185 190 

Pro Tyr Ser Ala Lys Tyr Ser Ala Ala Arg Ser Val Pro Ala Ser Asn 
195 200 205 

Tyr Val 
210 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2537 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW015 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 11. .2176 

(D) OTHER INFORMATION: /product= "predicted polypeptide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GTCGACCAAA ATG GCA GCC TTG GAA GAA GAA TTC ACG TTG TCT ACT GGA 49 
Met Ala Ala Leu Glu Glu Glu Phe Thr Leu Ser Thr Gly 
1 5 10 

GTC TTA GGC GCC GGG CCT GAA GGA TTC CTA GGT GTA GAG CCC AGT GAC 97 
Val Leu Gly Ala Gly Pro Glu Gly Phe Leu Gly Val Glu Pro Ser Asp 
15 20 25 

AAG GCC GAC CAG TTC CTA GTG ACC GAC AGA GGC AGG ACC GTC GTC CTC 145 
Lys Ala Asp Gin Phe Leu Val Thr Asp Arg Gly Arg Thr Val Val Leu 
30 35 40 45 

TAC AAG GTT TCT GAC CAG AAA CCC TTG GGC AGC TGG TCG GTG AAA CAA 193 
Tyr Lys Val Ser Asp Gin Lys Pro Leu Gly Ser Trp Ser Val Lys Gin 
50 55 " 60 

GGC CAG ACT ATA ACA TGT CCT GCT GTG TGC AAC TTT CAG ACT GGA GAG 241 
Gly Gin Thr He Thr Cys Pro Ala Val Cys Asn Phe Gin Thr Gly Glu 
65 70 75 

TAT ATT ATG GTA CAT GAC CAT AAG GTT TTG AGA ATA TGG AAT AAT GAA 289 
Tyr He Met Val His Asp His Lys Val Leu Arg He Trp Asn Asn Glu 
80 85 90 

GAC GTA AAC CTG GAT AAA GTA TTC AAA GCT ACA CTG TCA GCT GAG GTC 337 
Asp Val Asn Leu Asp Lys Val Phe Lys Ala Thr Leu Ser Ala Glu Val 
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95 100 105 

CAT AGG ATC CAC TCA GTA CAA AGA ACA GAA CCC CTG GTG CTG TTC CGA 385 
His Arg lie His Ser Val Gin Arg Thr Glu Pro Leu Val Leu Phe Arg 
110 115 120 125 

AGA GGC GCT GCT CGT GGG TTA GAG GCC TTG CTT GCA GAG CCA CAG CAG 433 
Arg Gly Ala Ala Arg Gly Leu Glu Ala Leu Leu Ala Glu Pro Gin Gin 
130 135 140 

AAC ATT GAA GCT GTC ATA CCA GAT GAG GAA GTG ATC GAG TGG TCA GAG 481 
Asn lie Glu Ala Val lie Pro Asp Glu Glu Val lie Glu Trp Ser Glu 
145 150 155 

GTT TTC ATG TTA TTT AAG CAA CCG GTT TTA ATT TTT ATT ACT GAA AAT 529 
Val Phe Met Leu Phe Lys Gin Pro Val Leu He Phe He Thr Glu Asn 
160 165 170 

CAT GGG AGT TAT GTT GCT TAT GTA CAA TTG TGC AAA TCA CAC AGC TTA 577 
His Gly Ser Tyr Val Ala Tyr Val Gin Leu Cys Lys Ser His Ser Leu 
175 180 185 

AGC AAA TAC ATA CTC TTA CTT GGA AAA GAA GAA AAA TCT GCT AAA CCA 625 
Ser Lys Tyr He Leu Leu Leu Gly Lys Glu Glu Lys Ser Ala Lys Pro 
190 195 200 205 

AAT TTT ACT GCA CGT GTG GAT GGG AAA TTC ATC TCC CTG GTG TCG CTA 673 
Asn Phe Thr Ala Arg Val Asp Gly Lys Phe He Ser Leu Val Ser Leu 
210 215 220 

GGC TCT GAT GGG TGT GTA TAT GGA ACC TTG GTA CCA ATA TAT TCA AGT 721 
Gly Ser Asp Gly Cys Val Tyr Gly Thr Leu Val Pro He Tyr Ser Ser 
225 230 235 

GAC ACG GAA AAC AAT CAG AGG TTA GTT AGA GCA TTG ATG CTC AAG ACG 769 
Asp Thr Glu Asn Asn Gin Arg Leu Val Arg Ala Leu Met Leu Lys Thr 
240 245 250 

GTT GTG TCT GGC AGT GCT CGA AAT GGT TCT GCC CTC ACC ATC CTG GAT 817 
Val Val Ser Gly Ser Ala Arg Asn Gly Ser Ala Leu Thr He Leu Asp 
255 260 265 

CAA GAC CAC ATA GCA GTC CTG GGA CCT CCA CTT CCA GCT TCT AAG GAA 865 
Gin Asp His He Ala Val Leu Gly Pro Pro Leu Pro Ala Ser Lys Glu 
270 275 280 285 

TGC CTC TCC ATA TGG AAC ATA AAA TTT CAG ACA TTA CAG ACA TCA AAA 913 
Cys Leu Ser He Trp Asn He Lys Phe Gin Thr Leu Gin Thr Ser Lys 
290 295 300 

GAG CTG CCA CAA GGA ACC AGT GGG CAA CTC TGG TAT CAT GGG GAA ATA 961 
Glu Leu Pro Gin Gly Thr Ser Gly Gin Leu Trp Tyr His Gly Glu He 
305 310 315 

CTA TTT ATG CGA CAT GGG AAA AGT CTA ACT GTG ATT CCA TAC AAG TGC 1009 
Leu Phe Met Arg His Gly Lys Ser Leu Thr Val He Pro Tyr Lys Cy S 
320 325 330 

GAA GCA TCA TCT CTG GCG AGC GCT CTT GGG AAG CTC AAG CAT ACT CAA 1057 
Glu Ala Ser Ser Leu Ala Ser Ala Leu Gly Lys Leu Lys His Thr Gin 
335 340 345 

GAG TCA GGC AGT CAT TCC GTG CCC CAT TTT GTA AAC TGG GAG ACG TGT 1105 
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Glu Ser Gly Ser His Ser Val Pro His Phe Val Asn Trp Glu Thr Cys 
350 355 360 365 

TCA GAA TAT GAA CTT GGG TCC TAC AGT GCA AAG CAG ACA AGA ACT CTT 1153 
Ser Glu Tyr Glu Leu Gly Ser Tyr Ser Ala Lys Gin Thr Arg Thr Leu 
370 375 380 

AGA AAA AAA ACT GAA ACA AAT TTA CAC CCA GAA GTT CCA GGA GTC AAA 1201 
Arg Lys Lys Thr Glu Thr Asn Leu His Pro Glu Val Pro Gly Val Lys 
385 390 395 

GAA CTT TTA TCA ATA ATA AAG AAA GAT TCA GAA AAG CAC ATT GAA GTA 1249 
Glu Leu Leu Ser lie lie Lys Lys Asp Ser Glu Lys His lie Glu Val 
400 405 410 

GAA CTA CGT AAG TTT TTG GCT AAG TCG ACA CCT GAC TTT CAT ACT ATA 1297 
Glu Leu Arg Lys Phe Leu Ala Lys Ser Thr Pro Asp Phe His Thr lie 
415 420 425 

ATT GGA GAC ATA GTG TCT GGA CTT CTG GGA AGA TGT AAA GTG GAA CCA 1345 
lie Gly Asp lie Val Ser Gly Leu Leu Gly Arg Cys Lys Val Glu Pro 
430 435 440 445 

TCG TTT TAC CCC CGG AAC TGT CTG ATG CAG CTC ATC CAG ACA CAC GTG 1393 
Ser Phe Tyr Pro Arg Asn Cys Leu Met Gin Leu lie Gin Thr His Val 
450 455 460 

CTT TCC TAC AGC TTA TGC CCT GAC TTG ATG GAG ATT GCC CTA GAG CAC 1441 
Leu Ser Tyr Ser Leu Cys Pro Asp Leu Met Glu lie Ala Leu Glu His 
465 470 475 

ACA GAT GTG CAG ATG TTA CAG CTG TGT CTG CAG CAG TTC CCT GAC ATT 1489 
Thr Asp Val Gin Met Leu Gin Leu Cys Leu Gin Gin Phe Pro Asp lie 
480 485 490 

CCT GAG TCC ACC ACC TGT GCT TGC TTA AAA CTT TTC CTG AGC ATT GGT 1537 
Pro Glu Ser Thr Thr Cys Ala Cys Leu Lys Leu Phe Leu Ser lie Gly 
495 500 505 

GAT GAC TGT CTT CGG GAC AGT AAT GTC AGC ATG GAG TCA GTT TTT GAT 1585 
Asp Asp Cys Leu Arg Asp Ser Asn Val Ser Met Glu Ser Val Phe Asp 
510 515 520 525 

TAT GGT GAT ACT ACA CAA GAT GAG AAA AAG GAA ATG GAA GAG CAA ACT 1633 
Tyr Gly Asp Thr Thr Gin Asp Glu Lys Lys Glu Met Glu Glu Gin Thr 
530 535 540 

GAA ATT GTT CAG AAT GGC TTC GGT CCT GAA GAT GGT GGC TGC AGT AAA 1681 
Glu lie Val Gin Asn Gly Phe Gly Pro Glu Asp Gly Gly Cys Ser Lys 
545 550 555 

GAT AGT CAG CAG TTG AAC AAA AAG CCT GGT GAC ACA GCA CAG GAG CCC 1729 
Asp Ser Gin Gin Leu Asn Lys Lys Pro Gly Asp Thr Ala Gin Glu Pro 
560 565 570 

ATC TCC TTC CCT GTG ACC TTG TGT CCT GTG GCA CCA AAG CGA GCA GCT 1777 
He Ser Phe Pro Val Thr Leu Cys Pro Val Ala Pro Lys Arg Ala Ala 
575 580 585 

CTG CTA AAT GCA GTC CTT CAT TCA GCA TAC AGT GAG CCC TTC CTC CTG 1825 
Leu Leu Asn Ala Val Leu His Ser Ala Tyr Ser Glu Pro Phe Leu Leu 
590 595 " 600 605 
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CCG CAC TTG AAG GAC ATC CCT GCG AAG CAT ATA ACG CTG TTT CTG CAG 1873 
Pro His Leu Lys Asp lie Pro Ala Lys His lie Thr Leu Phe Leu Gin 
610 615 620 

TAT TTG TAT TTC CTC TAT TTG AAG TGC ACT GGC AGT GCT ACC ATG ACT 1921 
Tyr Leu Tyr Phe Leu Tyr Leu Lys Cys Thr Gly Ser Ala Thr Met Thr 
625 630 635 

CTC CCT GGA GTA AGC CCT CCA ACC GTG AGC CAG ATC ATG GAT TGG ATA 1969 
Leu Pro Gly Val Ser Pro Pro Thr Val Ser Gin He Met Asp Trp He 
640 645 650 

TGC CTA CTT CTA GAT GCT AAT TTT ACT GTC GTC TTA ATG ATA CCA GAA 2017 
Cys Leu Leu Leu Asp Ala Asn Phe Thr Val Val Leu Met He Pro Glu 
655 660 665 

GCA AAA AGA CTT TTG CTT AAT CTT TAC AAT TTT GTG AAA TCG CAG ATC 2065 
Ala Lys Arg Leu Leu Leu Asn Leu Tyr Asn Phe Val Lys Ser Gin He 
670 675 680 685 

TCC ATC TAC TCC GAG CTC AAC AAG ATT GAA GTC AGC TTC CGG GAG CTG 2113 
Ser He Tyr Ser Glu Leu Asn Lys He Glu Val Ser Phe Arg Glu Leu 
690 695 700 

CAG AGG TTA AAT CAG GAG AAG AGC AGT AGA GGA CTG TAC TCC ATC GAA 2161 
Gin Arg Leu Asn Gin Glu Lys Ser Ser Arg Gly Leu Tyr Ser He Glu 
705 710 715 

GTT CTG GAA CTT TTC TGACAGCCAG AGACAGTCTT ACTGGTTGTC CTAGAACTGG 2216 
Val Leu Glu Leu Phe 
720 



ATATATAGAC CAAGCTGGCC 


TCAAACTCAC 


AGAGATCCAA 


CCCAGAGCAC 


TGCGATTAAA 


2276 


GGTGTGGCCA CCACACCTGG 


CTTAATGGAT 


TTTTTTATAT 


GAATGTACCT 


GTTCATCCAG 


2336 


GGGAGAAAGT TTTGTTTGTA 


TCCATCTCAG 


TGTCCAGAGA GGAGTGTGAG TCTCTGCACC 


2396 


ATCACATACC ACCATTCCTC 


AGTGCTACGG 


GTTCCACAGA 


CATCACTTCT 


GGGTTGCAGA 


2456 


CATCACTGGA AGAATGTTAC 


AAACTTTTTA 


AACATGGAAT 


TGAGTTGATT 


TTAAGTAAAC 


2516 


TTATTTGTGT ACTGATAAAA 


A 








2537 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Ala Leu Glu Glu Glu Phe Thr Leu Ser Thr Gly Val Leu Gly 
1 5 10 15 

Ala Gly Pro Glu Gly Phe Leu Gly Val Glu Pro Ser Asp Lys Ala Asp 
20 25 30 

Gin Phe Leu Val Thr Asp Arg Gly Arg Thr Val Val Leu Tyr Lys Val 
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35 40 45 

Ser Asp Gin Lys Pro Leu Gly Ser Trp Ser Val Lys Gin Gly Gin Thr 
50 55 60 

lie Thr Cys Pro Ala Val Cys Asn Phe Gin Thr Gly Glu Tyr lie Met 
65 70 75 80 

Val His Asp His Lys Val Leu Arg lie Trp Asn Asn Glu Asp Val Asn 
85 90 95 

Leu Asp Lys Val Phe Lys Ala Thr Leu Ser Ala Glu Val His Arg lie 
100 105 110 

His Ser Val Gin Arg Thr Glu Pro Leu Val Leu Phe Arg Arg Gly Ala 
115 120 125 

Ala Arg Gly Leu Glu Ala Leu Leu Ala Glu Pro Gin Gin Asn lie Glu 
130 135 140 

Ala Val He Pro Asp Glu Glu Val He Glu Trp Ser Glu Val Phe Met 
145 150 155 160 

Leu Phe Lys Gin Pro Val Leu He Phe He Thr Glu Asn His Gly Ser 
165 170 175 

Tyr Val Ala Tyr Val Gin Leu Cys Lys Ser His Ser Leu Ser Lys Tyr 
180 185 190 

He Leu Leu Leu Gly Lys Glu Glu Lys Ser Ala Lys Pro Asn Phe Thr 
195 200 205 

Ala Arg Val Asp Gly Lys Phe He Ser Leu Val Ser Leu Gly Ser Asp 
210 215 220 

Gly Cys Val Tyr Gly Thr Leu Val Pro He Tyr Ser Ser Asp Thr Glu 
225 230 235 240 

Asn Asn Gin Arg Leu Val Arg Ala Leu Met Leu Lys Thr Val Val Ser 
245 250 255 

Gly Ser Ala Arg Asn Gly Ser Ala Leu Thr He Leu Asp Gin Asp His 
260 265 270 

He Ala Val Leu Gly Pro Pro Leu Pro Ala Ser Lys Glu Cys Leu Ser 
275 280 285 

He Trp Asn He Lys Phe Gin Thr Leu Gin Thr Ser Lys Glu Leu Pro 
290 295 300 

Gin Gly Thr Ser Gly Gin Leu Trp Tyr His Gly Glu He Leu Phe Met 
305 310 315 320 

Arg His Gly Lys Ser Leu Thr Val He Pro Tyr Lys Cys Glu Ala Ser 
325 330 335 

Ser Leu Ala Ser Ala Leu Gly Lys Leu Lys His Thr Gin Glu Ser Gly 
340 345 350 

Ser His Ser Val Pro His Phe Val Asn Trp Glu Thr Cys Ser Glu Tyr 
355 360 365 

Glu Leu Gly Ser Tyr Ser Ala Lys Gin Thr Arg Thr Leu Arg Lys Lys 
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370 



375 



380 



Thr Glu Thr Asn Leu His Pro Glu Val Pro Gly Val Lys Glu Leu Leu 
385 390 395 400 

Ser lie lie Lys Lys Asp Ser Glu Lys His lie Glu Val Glu Leu Arg 
405 410 415 

Lys Phe Leu Ala Lys Ser Thr Pro Asp Phe His Thr lie lie Gly Asp 
420 425 430 

lie Val Ser Gly Leu Leu Gly Arg Cys Lys Val Glu Pro Ser Phe Tyr 
435 440 445 

Pro Arg Asn Cys Leu Met Gin Leu lie Gin Thr His Val Leu Ser Tyr 
450 455 460 

Ser Leu Cys Pro Asp Leu Met Glu lie Ala Leu Glu His Thr Asp Val 
465 470 475 480 

Gin Met Leu Gin Leu Cys Leu Gin Gin Phe Pro Asp lie Pro Glu Ser 
485 490 495 

Thr Thr Cys Ala Cys Leu Lys Leu Phe Leu Ser lie Gly Asp Asp Cys 
500 505 510 

Leu Arg Asp Ser Asn Val Ser Met Glu Ser Val Phe Asp Tyr Gly Asp 
515 520 525 

Thr Thr Gin Asp Glu Lys Lys Glu Met Glu Glu Gin Thr Glu lie Val 
530 535 540 

Gin Asn Gly Phe Gly Pro Glu Asp Gly Gly Cys Ser Lys Asp Ser Gin 
545 550 555 ~ 560 

Gin Leu Asn Lys Lys Pro Gly Asp Thr Ala Gin Glu Pro lie Ser Phe 
565 570 575 

Pro Val Thr Leu Cys Pro Val Ala Pro Lys Arg Ala Ala Leu Leu Asn 
580 585 590 

Ala Val Leu His Ser Ala Tyr Ser Glu Pro Phe Leu Leu Pro His Leu 
595 600 605 

Lys Asp lie Pro Ala Lys His lie Thr Leu Phe Leu Gin Tyr Leu Tyr 
610 615 620 

Phe Leu Tyr Leu Lys Cys Thr Gly Ser Ala Thr Met Thr Leu Pro Gly 
625 630 635 640 

Val Ser Pro Pro Thr Val Ser Gin lie Met Asp Trp lie Cys Leu Leu 
645 650 655 

Leu Asp Ala Asn Phe Thr Val Val Leu Met lie Pro Glu Ala Lys Arg 
660 665 670 

Leu Leu Leu Asn Leu Tyr Asn Phe Val Lys Ser Gin lie Ser lie Tyr 
675 680 685 

Ser Glu Leu Asn Lys lie Glu Val Ser Phe Arg Glu Leu Gin Arg Leu 
690 695 700 



Asn Gin Glu Lys Ser Ser Arg Gly Leu Tyr Ser lie Glu Val Leu Glu 
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705 710 715 720 

Leu Phe 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW016 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 3 85.. 748 

(D) OTHER INFORMATION: /label* SAC_24530 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATTATTCTCC 


CCTTGTCATT 


AAAAGAAATC 


AGCAAAACCA 


AACAACTGGC 


TACACCACGA 


60 


ATTGTCGTTA 


AATTTGCTAA 


CTGGTGTCTA 


AAAGCCGTGT 


AGCTACCTCG 


GTCCTGCTTG 


120 


CTAGGTTTGC 


CACTAGAAGG 


AAGCATACTT 


AAAACAATGG 


CTACTTGGAT 


CCTCAGGGAG 


180 


ATCCTGTCTG 


CAGTCGCGTG 


GTCACCCTTA 


GCTTCATCAA 


AGCACTAACA 


GCTCACCCGG 


240 


CCAGGCTTCA 


TGAGCACTGA 


CCCTCAAGCA 


AGCAGGTTTA 


TTAAACATTT 


AGATGCCAAC 


300 


CTCACTTACT 


GTTTCCTGCA 


GTCATGGAGA 


GTTTACTTAA 


CAAGTTTGTA 


AATAATAAAC 


360 


TGGCACTTTG 


CACACAGACT 


TGGTACTATC 


CTAGGGGAAG 


GCCTGCTTTA 


TTTGGTTTCT 


420 


AGACCGAGTA 


GGAAGTGATC 


CATTTACCAC 


TGAGGGCAGC 


CCCATTCAGA 


GTCTTAAGTG 


480 


ACTAAGCCAG 


TGTTGAACAA 


GCAATTTCCA 


GGCTTTGTTC 


TTCAGGGAAC 


TTCCCATCAG 


540 


CTTTGAAGTC 


GGTCCTGTGC 


ACCCTAGGCA 


CATGGATCAG 


TTCACAAGTG 


GGGTTCAGTG 


600 


GAGAGAACTT 


CCCCCTCAGA 


AGTCACTTGA 


AACTTAGATG 


AGATTTGGGA 


CACTTGCTGG 


660 


TTGACTCTGT 


CTCATTTGTG 


TAAAAAGTAG 


TTTNTNTNAA 


NANTNGGTTT 


TNTTTTTTTT 


720 


TTCAAGGTTA 


TACTTTGTCC 


CATTCCTANG 


TTANTACAAA 


GTCTTGAAAG 


GGCCTTTGTA 


780 


GGGCTTTTTA 


ANNCAGGGTC 


TTAACTATGT 


AACTCTGGCT 


TGGCCTGGAA 


CTTGCTATGT 


840 


AGACCAGGTT 


ACCCTCAAAC 


TTGCCTGTCT 


TCCCAAATAC 


TGGGATTAAG 


GTTTCTGTGA 


900 


CCATACCTGG 


CTTTACCTGA 


TGAATTCCTA 


AACACCAGAA 


AACCAGTACT 


GTATGAGATG 


960 
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TTAATGTGTG TTCCTTTCAG ACTGGAGTAC AGACCAGTAG ATAACAGATA ACAGCTGGTT 1020 

CACCTTAATC TGCCTTTTTG TGTATTAATC TGTGTTTAGA GAACGGAACA ATAGCCAGAA 1080 

TTNNNCTAGC GAGTTCGAGG CCAGTTGGTG TATATGTGGG ACTCTTAACC AAAACAGCAA 1140 

GCGTTCCCTG GGGTAGTTCA CAATGATCTC CAGCTTCCTT GTTAACCAGA TAACTGCNAN 1200 

TCCAGATGTA TGACCCTGGT TGGTTTATTG TATTGATATG TTTCTGTAAT ATGAGTAAAT 1260 

TATTGTTACT TAAAAGTAAT AAATTCAAGT TTCAAAATTG AAAAA 1305 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 955 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 



<vii) IMMEDIATE SOURCE: 
(B) CLONE: HW017 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 3 93.. 627 

(D) OTHER INFORMATION: /label= SAC_24091 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



GTCGACAATC 


CTGTTATGAC 


AAGCCCGTCT 


CGATACTGAC 


AACTGAAGTG 


GAAATTGCTT 


60 


GAACATCAAG 


ATAGCTAACT 


CGGAAAGAAT 


CCCAGACTTT 


GGATAGTTTC 


TGAGTCTTCT 


120 


AGAATTTTCC 


AAGAAGAAAC 


CATGGTGGCA 


AATGCGGAGA 


CTCGGGCGGG 


CACTGGGAAC 


180 


ACAGCACAGT 


GGTCTTAGGG 


AGGTGCTTTG 


TCAGGAATGA 


ACAGTCATGG 


TTATAATCCA 


240 


CGTTTCCATT 


GCTACTCACG 


AATGATTCTC 


TTCTGTTTTG 


TTTTTAAAAT 


TTTTTTACAC 


300 


TGAATTTCTA 


TTTAGACACT 


AAAACATATA 


GGGGTGCTTG 


TCACCCGGAT 


ACATTTATCT 


360 


GTGAGCCAGC 


TATTCGGATG 


TCATGGCTGG 


GTACCTAACT 


TACTTCCATA 


TGTGAAGTGT 


420 


GCTAAACTCA 


AACCAGTTTA 


CAGAAACGAT 


GTATTTTGTG 


TATAGTAAAT 


TGTATATTCA 


480 


CCCTTTTACC 


ACGGCCGGTT 


TTTTAAACAA 


ATGAATACTC 


TAGATTTTTC 


TTCCAAATGA 


540 


GGTTACTGTT 


GGGGTGGGGT 


TGACTTAGTG 


ATGCTGTAGA 


AGGGAATCCG 


CATGCACTAA 


600 


AAAGTGTGTC 


TGCCTAAAAG 


TGTGTACAGC 


AGGCACAACC 


TTCCTTGTGG 


ATTTCCGTCT 


660 


GCTCACTGCA 


GATCTGCCTG 


TGGTTTAGAA 


ATAGAATTCA 


AGAGCCATCA AGGAGTGACC 


720 


AGCTTGACAC 


CACTGCCAAA 


TTCAGAATGA 


GGAACCTTGA 


GAGAAGGAAC 


TGTCGCTCAG 


780 


CCAGAAGCAG 


ATGAAGCCAC 


TGGTCCTGGT 


TCACAGTTTA 


GTGTCATGCT 


CTGGTTTATA 


840 
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AGGATGGGCA TTACGTTCGG TGTTATCTGG TTATCTGCGA GATTCTAAAC CGAGGTGCAA 900 
GAGCTCCTAT CCTGATTCTG AGATTTGAAA TAAAGCATGA ATGAACAAAG AAAAA 955 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW018 

(ix) FEATURE: 

<A) NAME/ KEY: CDS 

(B) LOCATION: 86.. 1102 

(D) OTHER INFORMATION: /product= "predicted polypeptide" 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: complement (644.. 994) 

(D) OTHER INFORMATION: /label= SAC_23880 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GTCGACCTGG GAGTCCGGGC AGCGAGCCCC TCGCCGTCGG GTCCCACGAA GAAACTGGTG 60 

GCTGAAGCCA TATTCTGCAG CATTC ATG GAG ATC AGT AAA AAT ATT ACG AAC 112 

Met Glu lie Ser Lys Asn lie Thr Asn 
1 5 

CCA GAA GAA CTT TGG AAA ATG AAG CCT AAG GGG AAC CTG GAA GAT GAC 160 
Pro Glu Glu Leu Trp Lys Met Lys Pro Lys Gly Asn Leu Glu Asp Asp 
10 15 20 25 

AGT TAC TCG ACT AAG GAC ACC GGA GAG ACA AGC ATG CTG AAA AGA CCG 208 
Ser Tyr Ser Thr Lys Asp Thr Gly Glu Thr Ser Met Leu Lys Arg Pro 
30 35 40 

GTG CTC TCG CCT TTG CCG CAC ACG GTC CAC GTC GAC GCC TTC GAC TGC 256 
Val Leu Ser Pro Leu Pro His Thr Val His Val Asp Ala Phe Asp Cys 
45 50 55 

CCC ACG GAG CTC CAG CAC ACG CAG GAA CTC TTT CCA AAC TGG CAA TTG 304 
Pro Thr Glu Leu Gin His Thr Gin Glu Leu Phe Pro Asn Trp Gin Leu 
60 65 70 

CCA ATT AAA GTT GCC GCC GTC CTA TCG TCT CTC ACC TTC CTG TAC ACT 352 
Pro He Lys Val Ala Ala Val Leu Ser Ser Leu Thr Phe Leu Tyr Thr 
75 80 85 

CTT CTG AGG GAA ATC ATT TAC CCA TTA GTC GCT TCC CGC GAA CAG TAT 400 
Leu Leu Arg Glu He He Tyr Pro Leu Val Ala Ser Arg Glu Gin Tyr 
90 95 100 105 



-55- 



SUBSTITUTE SHEET (RULE 26) 



WO 98/53071 



PCT/US98/10547 



TTT TAT AAG ATC CCG ATC CTG GTC GTT AAC AAA GTC TTG CCA ATG GTC 448 
Phe Tyr Lys lie Pro He Leu Val Val Asn Lys Val Leu Pro Met Val 
110 115 120 

TCC ATT ACC CTC TTG GCA TTG GTT TAT CTG CCA GGA GAG ATA GCG GCA 496 
Ser He Thr Leu Leu Ala Leu Val Tyr Leu Pro Gly Glu He Ala Ala 
125 130 135 

GTT GTG CAG CTT CGC AAT GGG ACC AAG TAC AAG AAG TTC CCA CCC TGG 544 
Val Val Gin Leu Arg Asn Gly Thr Lys Tyr Lys Lys Phe Pro Pro Trp 
140 145 150 

TTA GAC AGA TGG ATG CTG GCG AGG AAA CAA TTT GGG CTC CTC AGC TTC 592 
Leu Asp Arg Trp Met Leu Ala Arg Lys Gin Phe Gly Leu Leu Ser Phe 
155 160 165 

TTC TTT GCA GTT CTG CAC GCC ATT TAC AGT CTC TCG TAC CCA ATG AGA 640 
Phe Phe Ala Val Leu His Ala He Tyr Ser Leu Ser Tyr Pro Met Arg 
170 175 180 185 

CGG TCC TAC AGA TAC AAG CTG CTC AAC TGG GCT TAC AAA CAG GTT CAA 688 
Arg Ser Tyr Arg Tyr Lys Leu Leu Asn Trp Ala Tyr Lys Gin Val Gin 
190 , 195 200 

CAA AGC AAA GAA GAT GCC TGG GTT GAG CAT GAT GTC TGG AGA ATG GAG 736 
Gin Ser Lys Glu Asp Ala Trp Val Glu His Asp Val Trp Arg Met Glu 
205 210 215 

ATT TAT GTG TCC CTG GGG ATT GTG GGA CTG GCC ATC TTG GCT CTC TTG 784 
He Tyr Val Ser Leu Gly He Val Gly Leu Ala He Leu Ala Leu Leu 
220 225 230 

GCT GTG ACA TCT ATC CCA TCT GTG AGC GAC TCT TTA ACC TGG AGA GAA 832 
Ala Val Thr Ser He Pro Ser Val Ser Asp Ser Leu Thr Trp Arg Glu 
235 240 245 

TTT CAC TAT ATT CAG AGC AAA CTG GGA ATT GTC TCT CTT CTT CTG GGC 880 
Phe His Tyr He Gin Ser Lys Leu Gly He Val Ser Leu Leu Leu Gly 
250 255 260 265 

ACG GTA CAC GCT TCG ATT TTT GCC TGG AAT AAA TGG GTA GAT ATC AGT 928 
Thr Val His Ala Ser He Phe Ala Trp Asn Lys Trp Val Asp He Ser 
270 275 280 

CAG TTT GTC TGG TAC ATG CCT CCG ACT TTC ATG ATA GCT GTT TTC CTT 976 
Gin Phe Val Trp Tyr Met Pro Pro Thr Phe Met He Ala Val Phe Leu 
285 290 295 

CCA ACT GTT GTT CTG ATC TGT AAA ATT GTA CTT TGC CTG CCC TGC CTG 1024 
Pro Thr Val Val Leu He Cys Lys He Val Leu Cys Leu Pro Cys Leu 
300 305 310 

AGG AAG AAG ATA CTG AAG ATT AGA TGT GGT TGG GAA GAT GTC CGC AAA 1072 
Arg Lys Lys He Leu Lys He Arg Cys Gly Trp Glu Asp Val Arg Lys 
315 320 325 

GTT AAC AGG ACT GAG ATG GCC TGC AGG TTG TAGAACTACT GTTTGCACAT 1122 
Val Asn Arg Thr Glu Met Ala Cys Arg Leu 
330 335 

AATTGCTTAA TACCGATGTC TTATAAACAT TTCAAACCTG TGTCTGTTAA TAAAATGAAT 1182 
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ACTTCTGGAT CAAAAA 1198 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Glu He Ser Lys Asn He Thr Asn Pro Glu Glu Leu Trp Lys Met 
15 10 15 

Lys Pro Lys Gly Asn Leu Glu Asp Asp Ser Tyr Ser Thr Lys Asp Thr 
20 25 30 

Gly Glu Thr Ser Met Leu Lys Arg Pro Val Leu Ser Pro Leu Pro His 
35 40 45 

Thr Val His Val Asp Ala Phe Asp Cys Pro Thr Glu Leu Gin His Thr 
50 55 60 

Gin Glu Leu Phe Pro Asn Trp Gin Leu Pro He Lys Val Ala Ala Val 
65 70 75 80 

Leu Ser Ser Leu Thr Phe Leu Tyr Thr Leu Leu Arg Glu He He Tyr 
85 90 95 

Pro Leu Val Ala Ser Arg Glu Gin Tyr Phe Tyr Lys He Pro He Leu 
100 105 110 

Val Val Asn Lys Val Leu Pro Met Val Ser He Thr Leu Leu Ala Leu 
115 120 125 

Val Tyr Leu Pro Gly Glu He Ala Ala Val Val Gin Leu Arg Asn Gly 
130 135 140 

Thr Lys Tyr Lys Lys Phe Pro Pro Trp Leu Asp Arg Trp Met Leu Ala 
145 150 155 160 

Arg Lys Gin Phe Gly Leu Leu Ser Phe Phe Phe Ala Val Leu His Ala 
165 170 175 

He Tyr Ser Leu Ser Tyr Pro Met Arg Arg Ser Tyr Arg Tyr Lys Leu 
180 185 190 

Leu Asn Trp Ala Tyr Lys Gin Val Gin Gin Ser Lys Glu Asp Ala Trp 
195 200 205 

Val Glu His Asp Val Trp Arg Met Glu He Tyr Val Ser Leu Gly He 
210 215 220 

Val Gly Leu Ala He Leu Ala Leu Leu Ala Val Thr Ser He Pro Ser 
225 230 235 240 

Val Ser Asp Ser Leu Thr Trp Arg Glu Phe His Tyr He Gin Ser Lys 
245 250 255 

Leu Gly He Val Ser Leu Leu Leu Gly Thr Val His Ala Ser He Phe 
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260 265 270 

Ala Trp Asn Lys Trp Val Asp He Ser Gin Phe Val Trp Tyr Met Pro 
27 5 280 285 

Pro Thr Phe Met He Ala Val Phe Leu Pro Thr Val Val Leu He Cvs 
290 295 300 

Lys He Val Leu Cys Leu Pro Cys Leu Arg Lys Lys He Leu Lys He 
305 310 315 320 

Arg Cys Gly Trp Glu Asp Val Arg Lys Val Asn Arg Thr Glu Met Ala 
325 330 335 

Cys Arg Leu 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1801 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW032 

(ix) FEATURE: 

<A) NAME/KEY: misc_feature 

(B) LOCATION: complement (1412.. 1793) 

(D) OTHER INFORMATION: /label= SAC_24831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TTTTTCCAGC TTTTTAAGTT GCATTTATTT TTGATAATCT GTACTGTAGC CAATCAATTC 60 

TATTCTCCAT TCTTGCTTAA TTGTATTAGC CATAAAGGAG TTCAGGTTCA TAGGACTCAG 120 

TTTCTCTATT AAACCGTCAT TGATTATGGA AATACTGTCT TAAGCACTAT GCAGATTAAC 180 

ACCATACTTT TCTTCCATTA CATCTTTCAA GACAGTCACT CACAACTCAG AAGACTGATA 240 

CAGCATTAGA TGAAAAAAAC AAGATTCACA GATATATGTA GTGAAACTCA AACTAGGAGT 300 

CTTAGAGATA AAAAAGATAA GTAGATATTT AAAATGTAGT CTGTTAAAGA TTTAATCGGT 360 

TTAAAGTATA AGATTAGAAA GACAAAGTTT CTATTGTTTT GAAAGTTTTA AGTAAAGAAA 420 

AAAAAACTGG AAAATATGTC TTTGTTGTTC TTTCTCCGTT CTTAATTACG AGCTATCCAA 480 

CAGAAACCCC TAGGTCATAG TTTATGTTTC TGTTCTCATC AATGTAACTA GAACACTGGA 540 

GAAAGAAAAA GGAAGACATA TTAACCAATG GCCGCAATCA CTTTTCACTT GCCCCACGGT 600 

ACAAATCTGA TTTGGATGAA ATGGTTTGTG GCATAAAAAT TTTAAATTTT AGAAGCCAAT 660 
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vj x u\7 1 /iuxi X i 


try? ••i'^o & fpi'Miiiii 

IV X V-Wfi X VJ X X 


i\iLALAAATC CAATTTCAAC T AC TTTT ACT TCACTAATTA 


720 


ATTAAATflfiA 


TTflf: A A A P A T 


IV A AbAAATb AAATTGTGTC TTTAGATGGA GTTGGGGAGA 


780 


VJ X A^V3U X 1 X #1 


XXX XiinnX/iX 


TTAGTTCAAA TTTCATTTTC ACTTCTTGGT CTTATTATTT 


840 


CAGPTTPAGT 

v/»w\» X X X 




x xv i w x\jV9wj L. T x Av- A l\iCT GACTGTGTCA GGCAGCTGGA 


900 


GnTHATPTAP 


A TTYI A P A fifi A 
/\X luALAUUn 


X X\-AA x\xAC T TTCTCCTCCG CGAAAAGCCC CTGTCAGCTA 


960 


PAPAAfifiA r PR 


uvanvaul X X x\* 


WaUvUVLTGT CAAGTGATTG CAGATCCTGA CTTCTCAAAT 


1020 


ATGAGOTflTP 




ijUiW-AW.IA uxXjwTAQjCTG CCCCAGACTC CTCAGATAAC 


1080 


TTAPAPAGGA 


VJN— X VJO X X 1A1 


wjiivuLIl UoxvAvsAGCT TCCTCTGGGC AAGTGAAAGT 


1140 


TGPAARPPTT 
x uwtnuuu x x 




UAITjIvTCAG CTCTCTAAGT GACCCTGCCT GTGCTGGCCC 


1200 


X XUluv^l InX 


ulj X w low 1 vi 


olXaGTGGTGC AAGCCTGAAG GAGGAAGACT TGTCTCAACA 


1260 


TPTl ATY2 PPP 


PTYSPTWS/^TV* 
V- XAjrv* X IviL lu 


ATTCTCTGAA GGCTTACTGC TCCCAGCAAC CGTCAGCCTC 


1320 


AfSTPfiPAnpp 


lv.uL X 


CCTCAACTTG GCAAAAATGC CTACAGAGAC TGAGAGATGC 


1380 


X VUAU X\»l* t» 


ivAI iul xVj 1 


TTTCCAGAAG TACAGTGGGA AGGATGGAAA TAGCTGTCAT 


1440 


rTTTTPl AAA 


L. ibAb 1 I v_v- x 


TTCCTTCATG AACACGGAGC TGGCCGCCTT CACGAAGAAC 


1500 


CAGAAGGACC 


PPGT3TGTPPT 


P/2APP^2^ ,, &'Prt R flV * fi » 7V C^r* mii/<% m j-. » ninj . - j— .... . 

UVjAC tbL A XVji A X\jAAoAAG(J TGGACCTCAA C AGTGATGGG 


1560 


CAGCTAGATT 


TCCAAGAGTT 


TCTCAACCTT ATTGGTGGCT TAGCTATAGC ATGCCATGAG 


1620 


TCCTTCCTCC 


AGACTTCCCA 


GAAGCGTATC TAACCCTCTC CATTCCCTTC CAGCCACCAA 


1680 


GTCATCGCCT 


CCTCCACTCC 


TTCCCCCATC CACACCTGCA CTGAGCCCAC CACACCTACC 


1740 


ACACATGCAG 


CCCACGCCTG 


ACAGGGAAAA TAAAACAATG TCATTTTTTT AAATGTAAAA 


1800 


A 






1801 



(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW033 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 50.. 263 

(D) OTHER INFORMATION: /label= SAC_23901 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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GTCGACTTGA CTTCTTAGTT TTCTGACGGG AGCTCTCTTC CTTCTGAGTA CTAGTCTCGG 60 

GCCTCCAGCC CCAGGAAGAC CCTCTGCTTT TCTCTCCACC CTTGGCTTGC TCCTCAGCAG 120 

GGAAGAAGTA GATCACAGTC TCACTGCCTT CAGTTCCCAG CTTTGCCAGT CACCAGCTGT 180 

GTAGTCTCAG GCCAATCACC ACCTCCTCTA GCCTAGGTAG TTGGTTTTAT GTGGGTGAAT 240 

GTTTTGCCTG GATGTATGTA TGTACACCAC CTGTGTGCCT GGTGCCCCTG CAGTCAGAAG 300 

AAAGTGCCAC ACCCCTAGGA ACTGGAGTTA AGATGGGTTA GAGCCTCCAT GTGGGTGCTG 360 

AGAACTGAAC CTGGTTCTCT GGAAAAGCAG CCGGCACTCA ACCTCTGAGC CAGCTCTTCA 420 

TCCCCGAGTC CGTTTATTTA ACCGGCTATA AAATGGATTT AAAACGACTT ACCTCACCAG 480 

GCCTGTGAAA CATTTAAAAA CTTAATAAAA ATTCTACACT TAATTGTTTT TGCTTATGCT 540 

TTCTAGTCCC TTTAGTCTTA GACAGTTTCT GTTTCTTGTA TTTATAAAGT TGGTATTAAA 600 

TATATAGCCA GTGGACAAGA ACCTTACTGC TATGGGCTGG CCCCTGTAGG GTCCTAGACT 660 

AGAGGAGCGT ACTTTGTGGT TTAAAGTGGT TTGGCCTGCC TACCTGTTTT TCCCACTGGA 720 

CTCAAAACAC TAATAAAGCA GGGGTGACCT GTGTCACCTG AGCATTCAGC AGAACACTGG 780 

CCCAGGCAAT GGCCTCAAAG CTGGGCTACA TTATAAAGTG AACTTTCTTG TTCATACTGG 840 

GTCACAGAAG CAGGCCCAGA AAAAATAAAA AATCCTGGTC ACTCGGTTCC CCCTTGCACC 900 

ATCATCTGTT CACTAGTGCC CCTCTGACCA TTCTCACCCA AGACAGGCAT CCTCTGGGAA 960 

CCTTAGGCAG GTCCATTTCT TCTTTGTCAT TTGGAATGAA GCACTAACAG CCCTGGGGTC 1020 

TCTGCCATTG GCTTCCCAGA GTTGTCCAAT GAGTTTTCGT GAAGGAATCT GGGTTTATCT 1080 

TTCCCGGGCT TCCTGCCCTG TTAGCAGAAT TTCAAATAGC AGATAATAGA TTCTGACCTG 1140 

TGAGGCCATT TTTCTCTCAA TCCCTTTCTT CTGAGACATT AGCCTTGGGC ATGCAGCAGC 1200 

TGCGCACTGG TTTCAGGGAG GAAAGGGGTG AATGGTGAGC GAACTTGGCC CTGTGGATGC 1260 

TAACTCCACT GAGATGAAAA CATGGGGGCT TTTCTCTGAA GACGCAGGGC CGTGTAAACA 1320 

TATGACTTTG ACCATATTCT CAGGTTTGGG GATGTGGGAA GTTCCAGGAA CAGAAATCCC 1380 

CTGCTGAGTT CTTGGGCACT GTGGCTATTT GTTCCCATGA GGACCTCTCC CTGCAGTCTG 1440 

CCCTCTGGCA GGGTGGAGTA TAGAAGACTC AAGAGCTACA GATGCCCCCG TCACTGGAGC 1500 

TCCCCTGCAC CTAGGCCATC AATCACGTCC ACCATCACCA GAGGGGCTTA GGCCTAGGGT 1560 

GTATGCTGAG GGAAAGACGA GGGCCTGTCC CACTACTCGA GTAGAGCAGA TGGCCTTCTG 1620 

TGTAGGCAGT GAATTTCATA GCAGTGCCTT GAGGAGCCCA GCATGTTCTC AAGCACCTCC 1680 

TCCCAGTTTA TATTTGGAAG TCTGCTTCAC AGGGACCAGA GGGCATAAAC CTAACCTACC 1740 

CACTTCCCAT CAGGGACAGC CCAAAGCACA CCAGTATATG TAAGGGGTGC AGGGAGGTAT 1800 

CATGGGTCTC TTCTCTCCTC TCAGTGCTCA AACTCAGGTG GAGGCTATGG ATTCAGGATC 1860 
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TTGGTTAGGA 
a x uvj x x nuuA 


vj l 1 \jV» AAvjv^A 


TV ITV^O TV /"» TV TV O TV 

A I\jbAC AAG A 


AAGAGCATTA 


GGG AAACCAG 


AATCCCAGCC 


1920 




V-v- X^V^rtV^V^V..^, 


1 A w AL 1 1 


vj AVjGC xT AUG 


TCCTGATACT 


CTGCCCTTAC 


1980 




ivwil X x\-L.v_ 


AWjAbi lAbT 


TV TV TV TV TV TV /"^ TV f 

aaaaaacagg 


CAAGGGTAAT 


ATGCTATCAT 


2040 


CTGTGGTrCT 




CAVjAU 1 1 LLL 


CTTCACTCCG 


CCGCCACTGC 


CACCACCACA 


2100 


GAGACATCTT 


PGR A A fST" 2k TT 


Avivsi 1 lA-FTx 


CTTGAACTGA 


ACTCCTGGGG 


AGAAGATGAC 


2160 


TGGTGCCACT 

* ww •! UV*wAw\. 


nUi X X \jL> 11— 


i\j iU l\ivil TA 


GGGGCCTTGA 


GGTGGCACAG 


TCCAGCCATA 


2220 


GAAGGATCAG 




TV rn TV /"* TV /"* IK mm 

LA 1 AC AUATT 


AATGCCAGGG 


AGTCAGATAG 


TTGATGCGAC 


2280 


lui rlruio nun 


AAVjr AviAVitj 1 \J 


AGATGGGGAC 


TCCAAGGTAC 


TGTAAAGAGA 


AATCAAGTTT 


2340 


'|wn/ '#iwn/"v* f*Ti JV 
1 Iv. 1 1LLL1A 


(jC rCCATCCT 


CCACTCCAAA 


GCCTCAGGGA TCTGAAAACA GTCTTCATCC 


2400 


AGTCTATGTG 


CCCACAACCT 


TCAATGGTTC 


CCTACTTCCT CTTGTATATT TTTGGCTATT 


2460 


CTCTTCACCT 


CTAATTAGCT 


CATACTGTTT 


CCCACTGACT CCCGTCTTTA ATGGCTTCAT 


2520 


AGGTTTTCAG 


TCACGTATGA 


ATCTACACTG 


CATCACACAG 


TTACAAAACC 


CAGAGTTGGG 


2580 


GTTGGGGATT 


TAGCTCAGTG 


GCCTAGCAAG 


CGTAAGGCCC 


TGGGTTCGGT 


CCCCAGCCCC 


2640 


GAAAAA 












2646 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW034 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 91.. 837 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 3101.. 4162 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 3561.. 3862 

(D) OTHER INFORMATION: /label= SAC_23897 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTCGACGGAA GAGCTGTTGT GACTGCGGTA CTGCGGAGGG CTCTCGGCGG CCCGCGACAA 
GGCTGAGTCC CGGGAGGTCC ATGCCCCACC ATG COT TCC TGT GAC ATC TGT GGT 
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Met Leu Ser Cys Asp He Cys Gly 
1 5 

GAA ACT GTA ACC TCA GAA CCA GAT AGG AAG GCT CAC CTA ATT GTT CAC 
Glu Thr Val Thr Ser Glu Pro Asp Arg Lys Ala His Leu He Val His 
10 15 20 

ATG GAA AAT GAA ATT ATC TGT CCG TTT TGC AAG CTG TCG GGT ATA AAT 
Met Glu Asn Glu lie lie Cys Pro Phe Cys Lys Leu Ser Gly lie Asn 
25 30 35 40 

TAC AAT GAA ATG TGT TTT CAT ATT GAA ACC GCT CAT TTT GAA CAG ACT 
Tyr Asn Glu Met Cys Phe His lie Glu Thr Ala His Phe Glu Gin Thr 
45 50 55 

ACQ CCA GAA AAA AGC TTC GAG ACG CTA GCC GCA GTG CAA TAT GAA AAT 
Thr Pro Glu Lys Ser Phe Glu Thr Leu Ala Ala Val Gin Tyr Glu Asn 
50 65 70 

TCA GAC CTC GGT AAT ACC AAG CTG CAC AGT ACA GTG GAA GTT ACC TCA 
Ser Asp Leu Gly Asn Thr Lys Leu His Ser Thr Val Glu Val Thr Ser 
75 80 85 

GGC ATC CAT TCA GCT TGT GCA TCG AAC TTT CCA AAG GAG TCG TCC GAA 
Gly lie His Ser Ala Cys Ala Ser Asn Phe Pro Lys Glu Ser Ser Glu 
90 95 ioo 

AGC CTT TCT AAA GAT AGG ACT TTA AAA CAA GAA GCT TTT TAT ACA GAG 
Ser Leu Ser Lys Asp Arg Thr Leu Lys Gin Glu Ala J£e Tyr J£ ctu 
105 110 115 120 

cfS «** ™* £ CT AGA A** T AC CAG AAA AGC AGA GAA AAG CAG TCC 

Ser Val Ala Glu Ser Arg Lys Tyr Gin Lys Ser Arg Glu Lys Gin Ser 
125 130 i3 5 

GGA TTG TCT GAG GCC CAA GGA TCA ATT TAT GAA ACA ACA TAC AGT CCC 
Gly Leu Ser Glu Ala Gin Gly Ser He Tyr Glu Thr Thr Tyr Ser Pro 
140 145 150 

CCC GAA TGT CCG TTC TGT GGG AGA ATT GAG AGG TAC AGT CAA GAT ATG 
Pro Glu Cys Pro Phe Cys Gly Arg He Glu Arg Tyr Ser Gin Asp Met 
15: » 160 165 

G^u" III £?! S? £ CA ™ G £A T 000 AGC CTT CTA GAA AGT CCG TTA 

Glu He His Val Lys Thr Lys His Ala Ser Leu Leu Glu Ser Pro Leu 

175 180 

GAA GAC TGT CAT CAA CCA CTC TAT GAC TGT CCC ATG TGT GGG CTT GTC 
Glu Asp Cys His Gin Pro Leu Tyr Asp Cys Pro Het c?s gS SE 
185 190 195 200 

TGT ACA AAT TAT CAC ATT CTC CAA GAA CAT GTG GAC TTG CAT TTA GAA 
Cys Thr Asn Tyr His He Leu Gin Glu His Val Asp Leu His Leu Su 
205 210 215 

GAA AGC AGC TTT CAA CAA GTA CTT GTG AGT GGG TAT GTG CCT GTG TGG 
Glu Ser Ser Phe Gin Gin Val Leu Val Ser Gly Tyr Val Pro Val JS 
220 225 230 

ACA CCC GAC CTG CGA TGT CAT CCT CAG GAA GGC CGT ACA ACT CCT TTA 
Thr Pro Asp Leu Arg Cys His Pro Gin Glu Gly Arg Thr Thr Pro Leu 
235 240 245 



162 



210 



258 



306 



354 



402 



450 



498 



546 



594 



642 



690 



738 



786 



834 
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AGA TGAGTTTCTT AGTATCTTGG AACACACCAG TTAAGCTCCC AAGTTTGCTT 



CTATCTCCAC 
TCACATGATT 
CGAGCTATCT 
GCTCTCGAAG 
TCTAAGAGAA 
TCCTGGTAAA 
TGCATCATGT 
TTTCCTTTTC 
CGCTAGGCAA 
TGGAAGATTA 
GTTGTTTAGG 
AAGGGTTTAT 
CAAGGCAGGA 
TGTTTTGCCC 
GTGGCATCAC 
CTTGCCTGCA 
GTAGCTCTGG 
AGCTATCTTC 
GCAGAATGTC 
TGAAACAGTC 
TAGAACCAGT 
TGAATTGAGG 
TTTTCAGAAT 
GGGCAAGTAG 
CAGAATCACT 
GTGCAGCTGC 
TAATCCCTAC 
CTGGTTCTGG 
TTTTCATTTG 
TCTATATCTT 



CTCCCTAATA 
CTGGGCACTG 
CCTGAGCCTG 
GTATAGGGTC 
GTTGTTTATA 
GTAGGAGATT 
ACCTGCTTTG 
CTTTTTTTTC 
GCGCTCTACC 
TTTCACGTCA 
GTTCCTGTTG 
TTTATCCAAA 
ACCTGTAGGC 
TGACTTGTTC 
CTACAAGGAC 
CTCGGTCTTT 
CTAAGTGTAA 
CTTTGAGTAA 
TGATAATTCT 
TCAGGGCATG 
GGGAAGGTGT 
AGTACCTAGA 
AACTGATGGG 
TGTCCAGTGT 
TTGGCTCCTC 
TCCCTGGGTT 
CAGAAATTTT 
CACTTTCAAC 
TATCATTATG 
TCTGTCTGTC 



ATGCTGGAGT 
AGCTTAGGTC 
CTGTCTGTCT 
TTGTATGAGT 
TTTTTATTAC 
TTAGCGTTCG 
CTCATTACCA 
TTTTTTTCGG 
ACTGAGCTAA 
GTGGTCTTCT 
CTGTAATAAA 
CATTTCCAGT 
AGGAGCTGAT 
AGACTCCTTT 
TGGGCCTTCC 
CTTTACTGTT 
ACTCTAGCTT 
GATTGTAAGA 
TACTTTAGGG 
CTAAAAATCA 
AAAATGGGTT 
AGACTCCTAA 
TGCGTTGGCA 
CGCTAGTACT 
CTTCTGGATG 
ATACTCCAGG 
GAGCCTTTGA 
TTTCACTACT 
TGATCTATTC 
TGTCTCTGTG 



TACAATCATG 
CTTATGTCTG 
CTTTAAATAA 
CTCCTAGGGC 
TCTGTCAGTA 
ATGATTTACC 
CAAGAGGAAA 
AGCTGGGGAC 
ATCCCCAACC 
GCTGTCTACC 
ACACTACCAA 
TCACCACTGA 
GCAAAGGCCA 
CTTAGAGCAC 
AACACCAGTC 
GTATTGTTTT 
GTGTCAAGTT 
GGATGCTGTG 
ATAGGCTTTG 
GTTGTGCAGT 
AATCAACAGG 
CTGTTTCTTC 
AACAGAATGA 
ATAGGTGGAC 
GGGCTTTCTT 
ATGTCTGACT 
CCTCACACTG 
GGTTTCTCTG 
CCGTATGATT 
TGTGTGTGTA 



GACAACCATA 
TGAGGCAAGC 
TATTAAATAC 
CACTGAAAAA 
TTTGGGAGTA 
ACCAATAACC 
ATTAGTAGAA 
CGAACCAAGG 
CCAAATTAGT 
CCAGTTGGCA 
CAAGAGTAAC 
GGGAATTCAG 
TGAAGTGCTG 
CCAGGGATAT 
ACTACAGAAC 
TTGAGATGGA 
GACCTAAAAC 
GAAGGGGTAA 
TATTGTATTC 
ACATGCCCAA 
TGTGTGCATT 
TGGGTGTATC 
AGAAGATGTG 
CAGGAAAGCC 
TCCCTGCAGC 
TTTGGATGCT 
GGATATCTGA 
TGTCTCAGGC 
CGTGATATTT 
CATACACACA 



CACACATGTT 
TCTTCCTGAC 
TTCATATAGA 
TAAAGTGACC 
TGTAGACAGC 
GTGATCTCTA 
TATTCTTTTT 
GCCTTGCACT 
AGAATATTGA 
TGCCACTGTC 
CTGGGCAGGA 
GGAAGGGACT 
CTGAGTGGCT 
CAGCCCAGGG 
GCACTACAGA 
GTCTCACTAT 
TAGCCAGCAC 
TTTTTTCAAA 
TATAAACCAA 
GAGTGGAAAC 
GTACTGTCTC 
TCTGAGGATG 
CATTCAGCGT 
GAAGCAGTGG 
TGCTCCCTGG 
GGAACTCTGA 
TTTTGCCAAA 
TATCAAGAAC 
AATAAGGTTT 
CACGCCTGTG 



887 

947 
1007 
1067 
1127 
1187 
1247 
1307 
1367 
1427 
1487 
1547 
1607 
1667 
1727 
1787 
1847 
1907 
1967 
2027 
2087 
2147 
2207 
2267 
2327 
2387 
2447 
2507 
2567 
2627 
2687 
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TGTGGTGTTT 


TCTTATCTTG 


CCCTCAGTCC 


TGTTACTGGA 


GGACCCTGAC 


CAGTAAAGAA 


2747 


TTTAGGAAAC 


AAGACTCAAT 


ACATTACAAT 


ACAATACAAG 


CCATTGACTA 


AAGAATCTGA 


2807 


GTTTCAGTGG 


GAGAGAGGAA 


GCATGGCATT 


ATGCTATAAA 


ATTATTCAGC 


TTGTTCGGGT 


2867 


AAAAACTGTA 


ATGTAATGAA 


ACAGTTGGAG 


GTTGTGAATA 


TATATTTTAT 


ATTATTTATT 


2927 


ACTTTTTTCT 


TTCCATTAGT 


TGTGTTCAGA 


GATATCAATG 


ATAAGGGTCT 


TTCAACCGCT 


2987 


CACCCTTTGT 


TTTTCTCCCT 


AAAGGTGTTA 


GGATGTTCAC 


AATTGCTAGT 


TAATGAGAAA 


3047 


TGCTTGTTAG 


AGCCTAAGCA 


CTGACTCATT 


CGGGTTGTCC 


CTCTCCAATA 


GGC ATG 


3103 



Met 
1 

GAC AGA GTC CAG TGT TCT AGT GAC AGA GAA TTA GCT CAC CAG CTT CAG 3151 
Asp Arg Val Gin Cys Ser Ser Asp Arg Glu Leu Ala His Gin Leu Gin 
5 10 15 

CAA GAA GAA GAG AGA AAG AGG AAA TCT GAA GAA TCA AGA CAA GAA AGG 3199 
Glh Glu Glu Glu Arg Lys Arg Lys Ser Glu Glu Ser Arg Gin Glu Arg 
20 25 30 

GAA GAG TTT CAG AAA TTG CAG CGG CAG TAC GGC TTA GAT AAT TCT GGA 3247 
Glu Glu Phe Gin Lys Leu Gin Arg Gin Tyr Gly Leu Asp Asn Ser Gly 
35 40 45 

GGA TAC AAA CAA CAG CAG CTG CGA CAC ATG GAG CTA GAA GTA ACG AGG 3295 
Gly Tyr Lys Gin Gin Gin Leu Arg His Met Glu Leu Glu Val Thr Arg 
50 55 60 65 

GGA CGA ATG CAT CCA TCT GAG TTC CAC AGC AGA AAA GCT GAC ATG TTG 3343 
Gly Arg Met His Pro Ser Glu Phe His Ser Arg Lys Ala Asp Met Leu 
70 75 80 

GAA TCA ATC GCT GTT GGT ATT GAT GAT GGA AAA ACA AAA ACT TCT GGA 3391 
Glu Ser He Ala Val Gly He Asp Asp Gly Lys Thr Lys Thr Ser Gly 
85 90 95 

ATT ATT GAA GCC CTC CAC AGG TAT TAT CAG AAC ATT GCC ACA GAT GTA 3439 
He He Glu Ala Leu His Arg Tyr Tyr Gin Asn He Ala Thr Asp Val 
100 105 110 

AGG TGT GTG TGG CTG TCT ACA GTG GTG GAT CAC TTT CAT TCA TCT TTT 3487 
Arg Cys Val Trp Leu Ser Thr Val Val Asp His Phe His Ser Ser Phe 
115 120 125 

GGG GAC AAA GGT TGG GGT TGC GGT TAT AGA AAT TTC CAG ATG CTG CTT 3535 
Gly Asp Lys Gly Trp Gly Cys Gly Tyr Arg Asn Phe Gin Met Leu Leu 
130 135 140 145 

TCA TCA TTA CTG CAG AAT GAA GTG TAC AGT GAC TGC TTG AAA GGT ATG 3583 
Ser Ser Leu Leu Gin Asn Glu Val Tyr Ser Asp Cys Leu Lys Gly Met 
150 155 160 

TCA GTT CCT TGT ATT CCA AAA ATT CAG TCC ATG ATT GAA GAT GCG TGG 3631 
Ser Val Pro Cys He Pro Lys He Gin Ser Met He Glu Asp Ala Trp 
165 170 175 

AAT GAA GGT TTT GAT CCT CAG GGG GCC TCT CAA CTT AAT AAC AAG TTG 3679 
Asn Glu Gly Phe Asp Pro Gin Gly Ala Ser Gin Leu Asn Asn Lys Leu 
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180 185 190 

£ AG S? 6 **** 000 ^ ATT GGA GCA TGT GAG ATC TAT ACA CTT CTG 3727 

Gin Gly Thr Lys Ala Trp He Gly Ala Cys Glu He Tyr Thr Leu Leu 
195 200 205 

ACC TCA CTG AAA GTC AAG TGC CGC ATT ATT GAT TTT CAC AAG TCA ACT 3775 
Thr Ser Leu Lys Val Lys Cys Arg lie lie Asp Phe His Lys Ser Thr 
210 215 220 225 

GOT CCT TCA GGC ACA CAC CCT CGC TTA TTC GAG TGG ATA CTG AAC TAT 3823 
Gly Pro Ser Gly Thr His Pro Arg Leu Phe Glu Trp lie Leu Asn Tyr 
230 235 240 

TAT TCC TCA GAG ACG GAA GGG GCG CCA AAG GTC GTG TGT ACA TCC AAA 3 871 

Tyr Ser Ser Glu Thr Glu Gly Ala Pro Lys Val Val Cys Thr Ser Lys 
245 250 255 

CCT CCT GTC TAT CTC CAG CAT CAG GGT CAC AGT CGA ACA GTT GTT GGG 3919 
Pro Pro Val Tyr Leu Gin His Gin Gly His Ser Arg Thr Val Val Gly 
260 265 270 

S? 6 » GA *** AAC CGA ACA TTA TGT TTG CTA ATA TTT GAT CCT 3967 
He Glu Glu Arg Lys Asn Arg Thr Leu Cys Leu Leu He Phe Asp Pro 
275 280 285 

GGA TGT CCT TCT CGA GAA ATG CAG AAG CTG TTA AAG CAA GAC ATG GAG 4015 
Gly Cys Pro Ser Arg Glu Met Gin Lys Leu Leu Lys Gin Asp Met Glu 
290 295 300 305 

GOT GGT AGT CTC AGG CAG CTC CGG AAA TGT GTG GGA AAT CTG AAG CAT 4063 
Ala Gly Ser Leu Arg Gin Leu Arg Lys Cys Val Gly Asn Leu Lys His 
310 315 320 

AAG CAG TAC CAG ATA GTA GCA GTG GAG GGT ATC CTG TCC CCA GAG GAG 
Lys Gin Tyr Gin He Val Ala Val Glu Gly He Leu Ser Pro Glu Glu 
325 330 335 

Ar« a?« t G ° ^ G ? T TCT CAA GTG TTT ACA GCG GAG AAG ATT 4159 

Arg Ala Ala Arg Lys Gin Ala Ser Gin Val Phe Thr Ala Glu Lys He 
J 40 345 350 

CCT TGACCCAGAT ATTTTGTAAT TATCCTTTTT GTTTCCGATA TTGAACTCTG 4212 



ATACAGTTGA AGAATTTGAC TTCACTGAAG TCGGTGATAA ATTATTTAAG TTATAAATGC 
CTGTTGTACT CCTTAGCATT TCGTTGTCTG GTAATCAGTT GAATAAATTC CTTGTTTACA 
TTAAAAA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



4111 



4272 
4332 
4339 
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Met Leu Ser Cys Asp lie Cys Gly Glu Thr Val Thr Ser Glu Pro Asp 
15 10 is 

Arg Lys Ala His Leu He Val His Met Glu Asn Glu He He Cys Pro 
20 25 30 

Phe Cys Lys Leu Ser Gly He Asn Tyr Asn Glu Met Cys Phe His He 
35 40 45 

Glu Thr Ala His Phe Glu Gin Thr Thr Pro Glu Lys Ser Phe Glu Thr 
50 55 60 

Leu Ala Ala Val Gin Tyr Glu Asn Ser Asp Leu Gly Asn Thr Lys Leu 
65 70 75 80 

His Ser Thr Val Glu Val Thr Ser Gly He His Ser Ala Cys Ala Ser 
85 90 " 95 

Asn Phe Pro Lys Glu Ser Ser Glu Ser Leu Ser Lys Asp Arg Thr Leu 
100 105 no 

Lys Gin Glu Ala Phe Tyr Thr Glu Ser Val Ala Glu Ser Arg Lys Tvx 
115 120 125 

Gin Lys Ser Arg Glu Lys Gin Ser Gly Leu Ser Glu Ala Gin Gly Ser 
130 135 140 

He Tyr Glu Thr Thr Tyr Ser Pro Pro Glu Cys Pro Phe Cys Gly Arg 
145 150 155 * 160 

He Glu Arg Tyr Ser Gin Asp Met Glu He His Val Lys Thr Lys His 
165 170 175 

Ala Ser Leu Leu Glu Ser Pro Leu Glu Asp Cys His Gin Pro Leu Tyr 
180 185 190 

Asp Cys Pro Met Cys Gly Leu Val Cys Thr Asn Tyr His He Leu Gin 
195 200 205 

Glu His Val Asp Leu His Leu Glu Glu Ser Ser Phe Gin Gin Val Leu 
210 215 220 

Val Ser Gly Tyr Val Pro Val Trp Thr Pro Asp Leu Arg Cys His Pro 
225 230 235 240 

Gin Glu Gly Arg Thr Thr Pro Leu Arg 
245 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Asp Arg Val Gin Cys Ser Ser Asp Arg Glu Leu Ala His Gin Leu 
15 10 15 
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Gin Gin Glu Glu Glu Arg Lys Arg Lys Ser Glu Glu Ser Arg Gin Glu 
20 25 30 

Arg Glu Glu Phe Gin Lys Leu Gin Arg Gin Tyr Gly Leu Asp Asn Ser 
35 40 45 

Gly Gly Tyr Lys Gin Gin Gin Leu Arg His Met Glu Leu Glu Val Thr 
50 55 60 

Arg Gly Arg Met His Pro Ser Glu Phe His Ser Arg Lys Ala Asp Met 
65 70 75 80 

Leu Glu Ser He Ala Val Gly He Asp Asp Gly Lys Thr Lys Thr Ser 
85 90 95 

Gly He He Glu Ala Leu His Arg Tyr Tyr Gin Asn He Ala Thr Asp 
100 105 no 

Val Arg Cys Val Trp Leu Ser Thr Val Val Asp His Phe His Ser Ser 
115 120 125 

Phe Gly Asp Lys Gly Trp Gly Cys Gly Tyr Arg Asn Phe Gin Met Leu 
130 135 140 

Leu Ser Ser Leu Leu Gin Asn Glu Val Tyr Ser Asp Cys Leu Lys Gly 
145 150 155 160 

Met Ser Val Pro Cys He Pro Lys He Gin Ser Met He Glu Asp Ala 
165 170 175 

Trp Asn Glu Gly Phe Asp Pro Gin Gly Ala Ser Gin Leu Asn Asn Lys 
180 185 190 

Leu Gin Gly Thr Lys Ala Trp He Gly Ala Cys Glu He Tyr Thr Leu 
195 200 205 

Leu Thr Ser Leu Lys Val Lys Cys Arg He He Asp Phe His Lys Ser 
210 215 220 

Thr Gly Pro Ser Gly Thr His Pro Arg Leu Phe Glu Trp He Leu Asn 
225 230 235 240 

Tyr Tyr Ser Ser Glu Thr Glu Gly Ala Pro Lys Val Val Cys Thr Ser 
245 250 255 

Lys Pro Pro Val Tyr Leu Gin His Gin Gly His Ser Arg Thr Val Val 
260 265 270 

Gly He Glu Glu Arg Lys Asn Arg Thr Leu Cys Leu Leu He Phe Asp 
27 5 280 285 

Pro Gly Cys Pro Ser Arg Glu Met Gin Lys Leu Leu Lys Gin Asp Met 
290 295 300 

Glu Ala Gly Ser Leu Arg Gin Leu Arg Lys Cys Val Gly Asn Leu Lys 
305 310 315 320 

His Lys Gin Tyr Gin He Val Ala Val Glu Gly He Leu Ser Pro Glu 
325 330 335 

Glu Arg Ala Ala Arg Lys Gin Ala Ser Gin Val Phe Thr Ala Glu Lvs 
340 345 350 
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lie Pro 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2415 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW035 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: complement (1738.. 1910) 
(D) OTHER INFORMATION: /label= SAC_24468 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

GTCGACGGGC CCTATTCTGC CTCCTTCAGG ATCTGCGTCT TCCAGGGATC CGCTGTTCGA 60 

CTTGGAGGTG CTTCTCTGCA GGTCCCAAGC CTGCCTCCTC ACCCCAGCTT TCCAGTGGAG 120 

GCTGCCCCCT TCCAGGCTTG GCTCAGGAAG CTCCTCATAG GCGGATGCCC ACCCATAAGA 180 

GGCTACATCC CTTTGTGGTC ACATGGCCTG TAGCCTCTTG CTGACATTGC TGGTGTCTTC 240 

TCTGGGGCCC TGGATCCCTG ACAAGCACAG AAATGTGATC TGGTAAAAGG AAAGACTGTG 300 

GCCTGATAGG CCTTTACCTC GTAGGCAGCA GGATTTTCAC TCAGTTTGAC ACAGCATGCA 360 

GTGCTCTGGG AGCAGGACCT CTGCCACCCT GGCATTTTTG GTAGGCTGGT TTGGCTCAAG 420 

GTGCTCAGAC ACTCCCATTG ATGCCTGAGT CGGATATGGG TGGAAATCTG GACTGTTGAT 480 

TATAAGTTAC CAGGTGACCT CATTCCCTCC CTTTGTCTCT TAACTGGTAT CCAGTGAGTT 540 

GCTGATGCCG ACTGGCAAGC AGAGCTGGCC CATGCCTAGA GGAGGAGAGG GACAGGCTTT 600 

CTGAGTGCTT TGATTCTGGC TGGGCTGTAG TGTATAGCTG AGAATGGCCT CCTGCATGGG 660 

TCCAGCCTTT TGGGAGGATG AGGAAGAGGA GGCTGAGGCT AGATGTGTGT TAAACAGGAG 720 

TAGCAGATCT TTGCTTCTGA ACCGGATAGA GCCGTGCTCA AGGCTTTCAG GTTTGCAGTA 780 

TCTTCTTATT GAGTCTGGGG AACGTTCCCT ATTGAGCCAG CCCTGCAGCT CCACTCCCTG 840 

CCCCATTCCC CTGATTTCTT GACAGTAAAT CCTGTGTCCA GTTGTGACTG TTGGCTGTTT 900 

TCCTGATCAT TGTTAATTCC GTTTTCCTAC TGTGAGGGTG TTGTGTTTGA CTTTCTGACA 960 

CAGCGAATGG AGAGCTTGGG CTGAGGCCAG GTCTCCCAAA CTTCTCCGCT AAGTAAACCG 1020 

TAATGTGTGA CTTCCCTCCT ATCTTGACTG CCCGACTCCT TCCTTCCACA TGGACACTCA 1080 
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GATTTCAAAG AAAACCCGCA GGAGGTGGGT CAGTTTCCTC TGTAGAATAT GGTCCCAGAG 1140 

AGTCCTCCCC TCTGTTGTGT CTAGCTGCTA GTCTGGGTGT TGTACCATCA GTGCAGGGAG 1200 

TCCACGGCAA CACTGAGTTA TACAGGGACC ACTACCATCG CCAGGCTTCC TAGCAGGCTG 1260 

TTCTCAGTTC ATCTTTCAGG TCCGTGTGTA CTCTGTGCTA GCAGCAGCCT TGGTGTTCAA 1320 

GGCCAGGATT TGCAGTTTTG ATAAGCTACA GCCCACTTGG AGATGGTGAG GCCACTTCCC 1380 

TGCTACCTCC CTCTCTGGTG TGGCCACTTG GCCAAAGCTC CCATCCACAG CTGGGGTGTC 1440 

TGAGCCACAG CCTGCTACAC TGTTTCCGCT CTGGGATCTT CTGAGTTACT CTTTTAGGCT 1500 

GTGGTTTGGT GAAAGGAACC AACACATTAA CGATCTCCCC CCCCCCCCCC AAAGCCACTG 1560 

AGTAATTCTT GGCATGTTTT CTAAGTGCTG TTGGCTGCCT GGTTTGGGAG CAGTTGTGGA 1620 

GAGGACAGAC TTAGTACCAC ACAGAGGCAT GGGTGTGCGA GCTGCTCACA GCTTCCTTGT 1680 

TTCTGGCATG TGTCCTGGGT GCGTGCTATA CCTCCCGGGC CAGCTGCACA CCTATGTACT 1740 

CTGCCCCTGA GAGGTAGAGT GTTCAGGAAG TGTCCCTGAT AGATTCCCTG TTCTCTGTGG 1800 

CCCCAGTTGG GCTTTTGGCT CCTCCAGCAA CTTCATGCCC CAACCCTACC TTTCCTCCCA 1860 

TTTCTCTCTG TCTGCTAGTC CCTGAAGCCT TTAACCAAAT GGGAATGGGT ACAGAAAGCC 1920 

TTCTCCTGGC AACTTAGGGC AACAGGACAA GGGCTATTGT GTGTCCAACT CTGTTGGTCT 1980 

CCTGGTGCTG AGAGGTGTGT CCAAGCAGAG TTGATCAGCC CCTGCCTGCC CTGCAGGGCT 2040 

GAAGCCAGGG GGAGGTTAGT AGAGAGTACC CTAAATGGGG TGAGGGTCCT TAAACATCCA 2100 

AGTGAGGAGG GGCTGTTGTG TGGCTGCAAG GTGAGACCCT GTCTCCAGCA CCTGGGTAGG 2160 

GCCATCGGGG ATCTGGCTGC AAGGTGAGGT CTGGACTTGG CATCAGTGCA GAGCGATCAA 2220 

GTCACCTGGC CTCAGCAGGC AGCAGCCTTG GTGCCCACCC GTGCCCTGCG CAGTATTTAT 2280 

TGCTAAATTA TTGTCCAGGA GGGGCGGCAC TGGGCCTGGC CCCCCGGGTA TTTATTGCTG 2340 

TACATAGTGT ATGTTTGTGA TATATAAGGT TTTCTTTATT TTGTATATGA TCAATAAACG 2400 
CCTTTTAAAG AAAAA 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW036 



2415 
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(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: complement (2541.. 2575) 
(D) OTHER INFORMATION: /label= SAC_24406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GTCGACTGAA AACCACTCAG AATTTTCTCT GG TCCTTAAC AGAAAAGAAT CAAAGTAGGA 
AGCACAGTAG CTGCCACATC TGTTCAACCT CTGGGCCTCC CAAGTACACC AGAGAAAACA 
GATCAATGCT GTTTCTGAAT GATTCTGCTG GTTAAAGTCA TTTATAGAAG ATTCACGATA 
CTGAACAACT ATGGGTTAAG AACTTACTGT GTTTTGTGAA TTGCCCAATG GAGAGAGTGT 
GTTCAAGGAT GACTCTTGAT CAGATTACAC AGTATGTGCA TCCAGGAGGC CACATGAAAT 
CAGCATAGTT AGTAACATAG AATCTGTTGT CAGCCTCCTC CAACTCGTGT CTGTTTCTTG 
GTTTCAATTT TAGGCCAGCC CTCTCCTCAT AAAGGGGTGA TAGTAGCCAC AGTCCACAGC 
CTCAAGGTCT GGTCTCCAAA AAGAAGAGTG TGAGCTGGGC CACAATCATA CCCTCCTGAA 
GAACTCTGGT CTTTTCTGGT CATATGCTCC TGCCCCACCC TTGGTGATCC TGGCTAGATC 
ATGGTCTCCC GTACTGTGCC GACACCATTC ACAGTAACAT AGGATGAGTT ATGGGCCTCA 
TCCCTAAACA CTGGAACTTG CAGCAAATGA TGTAGCAGAT ATTCACTCTA GATAACAAAT 
GTCAGTGACA TGAAGTGGCT AGGAAAGTTT AGAACCCTGA TCATTTTATT TTTACCACTT 
AACACAGTAG AGAGCCATCA GGACTAAATC TTTGGTGTGA TTCTTGCCAT TGAAGTCATT 
ATAGCTTCCA TATTGTTCCG TAGTTCACTG GAAAGTGCAC TTGCTGTTAA AAGTTTTGAA 
GCTGTGAGGA TTCCCTAGGG TGTCACTGAA GTCTGCTCTG TATACTGCCC CCACTGGGAG 
TGCACGGTAC TGCACCCTTA AATCCTGCCG CCTGTCTCCC AAACATGGTG TCTGCGTGCA 
GTGTGGAGTA GTGCGCCTTC TTTTTTTTTT TTTTTTTTTT CCTATTCTTT TTTTGGAGCT 
GGGAACCGAA CCCAGGGCCT TGTGCTTGCT AGGCAAGTGC TCTACCACTG AGCCAAATCC 
CCAACCCCTG TAGTGCGCCT TCTATACTAG AAAGCTTGAC CACTGAGCCA CACCTCCCAC 
TAGTGCTTCA ATGTCAACCG AGAGTAAAAT GTGTTTGTAT GAAATGCCTC CATTTGACTA 
GATAGAGCTT TATTTGGAGA AAGTCACATA TAACATAATT GAACTTTGAA TTATACAATC 
CCGTGGATTT TAGAGTGCTC CTGGAGCAGG TGGCAGTCAC CACTATCTAC TTCCAGAACA 1320 
GTCTCATCCT TTCCAGAAAC CCACACTCTG TCTTTCCTCT ATTCCAGATC TGTTAGACGA 1380 
GTGGAATTAC ATAGTCCGGT CTTTTCTGAG TTCTGTTACT AAGTTTTAAA GGTTTATTCT 
CAGGTAGCAT CAGTCCGTAA TGTATTACTG CTGAATAGTG TTCCGTGTAT ACAGACACCG 
TGTGTGTCTT CTTCCAGCGA GCAGAGGAAC TCTGAGCTGT TTCTACTTTG GGGCTTTTGA 
CTAATGCTAT GAACATCTGT GAAAAAGTTC GAAATGTTTG ATTTAGTACA GACCCTAGTG 
GGGAGCTCCG GGGTCATATT ATGACAGCCT CAATTGTACT TCCTACAGTG GTTTTACCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1440 
1500 
1560 
1620 
1680 
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CATTTCCTGC TCTCGTGGGA TCTAGGCTCC AGCATCCCTC ACAACTTTCT GCCTGAGATG 1740 

AAGAGGCATC TGATTGGGAT CTTGGTTTGC ATTTCCCTAA TGTCTAATAA TCTGAGCTTT 1800 

TTTTCATGTG TTCATTGGCT TTCTATGCTG CTTTGCAGAA TGTTTATTTC AGGCTACAGT 1860 

CTGCCTTTCA GCTGGGTTAT CTTTCTGTTT TTCTGTAGGA TTTTTTATTT ACGGTCAACT 1920 

CATCTCTTAA AGATTAATTG GCATTTTTTT TTTTCTCAAC TTGTGGTTTG CCCTTCTCAA 1980 

TGTCTTGATG CTCTCTGAAG CACAGTTTTT ATTTTTATAT ATTGCCTTTC TGCGTCTTAG 2040 

CCCAGGCTGG CCAGGAATCC CCAAAGCCTT GTACACAGAG TGAGCATTTT CACCGCCTCA 2100 

CTTCCCAGAA GGCACACGGC GCTGCTCCTG CCTAGAGCAG TTTTCATATT GGAAAATAAC 2160 

AATGTGTTTT TTCTTTCAGT GTATCCTAAG AAACCATTCC TAACCTGGAC TTACAGAAGC 2220 

TTATGTCTTC ACTAAAGAAT TTTTAAATTT TACCTCTTAC CTGTAGGCCT GTGCTCTCAA 2280 

AATGTTACAC ACCTTTATTC ATGGCCATAT TTAAAATTAA GTAGAAATAG TGAGTATGTA 2340 

GATATGCCTG TATCGTGATG AGCACATTCA CAGTGCAGTG CAGTTGGCCT GCAGCCCAGT 2400 

TCTGCATCCA CCTCAGAAAA TGCCCAGTGA GCCCATCTAA TGACTCCCAG CCCCTGCCAG 2460 

GTCCTCCCTC CCTGCCCTGT CTTGGCATAC TCTCTTCTCC ATGGCAATCA CCATTCTCTG 2520 

TCCTGTTTCT GAATTTGAGT ACCCTAAGTG CCTGGTATAC ATAGATTCTC AGTGCGTGTG 2580 

TGTGTGTGTG TGTGTGTGTG TGTGCGTGCG TGTTCATGTA GCAGAATGCC GTGAAGGCTT 2640 

GTATCATGTA TGAGTTTCCT TTGTAAGGAC AAATAGTACT TTAGACTTTA GTGTGTGTTG 2700 

TATGTATGCT GTATGTTTCT TGTTACCTAC TCATCCAGCC GTGGGTACCT CAGTTGTTCC 2760 

CACCCTGGAG CTCCTGACTA TGCTGTGAAC ATGTATATAC AAATATCCCT CGGATCCTGC 2820 

TTAGCATACA CGCAGCAGAT TGTATGTCTT AGGGTAACTC TGCTGTTATT AGTAGGCACT 2880 

GCCGTAGTCT CCATAGTGAC CACACTATTG TATGTCCTGG CAGCAGTGCA CAGAGATTAG 2940 

AAGTGCCCAT GTCCTTGTGA GTCTTACTCT TTTTTTAGTG GCTGTTCTAG TAGGTGTTAA 3000 

GATCTGCTAG TGCTTCTGCT TCCCATCTCT CTGACTACTG ATGTCACATG TTTTATATGC 3060 

TTGGTGGCCT TCTTTATATA TCGTTTGAAG AAATCTCTTT AGGTCCACTG CCCATTAATA 3120 

GTTGGGTCGT TGGCTCTGGT TGAATTTTTG GTAGTCACAT CCACAAAAGC ATTAACCCAC 3180 

CCAAGGGAGC CTACTACAGC TTGCCCTAGG AAACTTGATT TTAGCTCTTA GCTTTGCTCT 3240 

CTGGTATATT TTGAGTTCAT TTGTGTGTGA GATGTAAGGT CTGACTTTCT ACTCTACCCA 3300 

TGTGGATACC TGGCTTTGCA TGTTGCCCTT GGTGGGAAAA CATCCCGCTG TGCATGGTGG 3360 

GGCTGTTTCT GAGCCTGGTC TCTTCCATGA CCTCTGTCTG CCTTCATGCC AACGCCACTG 3420 

CTTTGATTAC TGGAGAATTA TAATAGAAAT TTGAAATCAG AAAATGTTTG TTCTGTAATA 3480 

AAAATTTGAA ATCAGAAAGT GTTCTATAAT AAAAATTTGA AATCAGAAAA A 3531 

-71- 



SUBSTITUTE SHEET (RULE 26) 



WO 98/53071 



PCT/US98/10547 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW037 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

<B) LOCATION: 280.. 1422 

(ix) FEATURE: 

(A) NAME/ KEY: raisc_f eature 

(B) LOCATION: 1370.. 1796 

(D) OTHER INFORMATION: /label= SAC_24354 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GTCGACCTGC GGCTGGCCGG CGGGGCAGGC CCCGGACCCG TGCGGTTCCT GGGCATGGTG 60 

AGCAAGGGGC TGCTGCGCCT TGTCTCTTCA GTCAACCGCA GGAAGATGAA GCTTCTGCTG 120 

GGCATCGCGC TGTTCGCCTA CGCCGCCTAA CTTGCTCATG ATGGAGTGCC AAGTGCCTCA 180 

GAGAAACAGG CCACCAGAAG AACAGTGCAT TCTCAGTACA AGGGCAGCGT TCAGCATTTG 240 

GATATTGGAG TTGAAGCTGT TTGGGGCAAC TTTGTTAAC ATG AGG TCT ATC CAG 294 

Met Arg Ser He Gin 
1 5 

GAA AAT GGT GAA CTA AAG ATT GAA AGC AAG ATT GAA GAG ATT ATT GAA 342 
Glu Asn Gly Glu Leu Lys He Glu Ser Lys He Glu Glu He He Glu 
10 15 20 

CCA TTA AGA GAG AAA ATC AGA GAT TTG GAA AAA AGT TTC ACC CAG AAA 390 
Pro Leu Arg Glu Lys He Arg Asp Leu Glu Lys Ser Phe Thr Gin Lys 
25 30 ~ 35 

TAC CCA CCA GTA AAG TTT TTG TCA GAA AAG GAC CGG AAG AGG ATT TTG 438 
Tyr Pro Pro Val Lys Phe Leu Ser Glu Lys Asp Arg Lys Arg He Leu 
40 45 50 

ATC ACT GGA GGT GCG GGC TTT GTG GGC TCC CAT CTA ACT GAC AAA CTC 486 
He Thr Gly Gly Ala Gly Phe Val Gly Ser His Leu Thr Asp Lys Leu 
55 60 65 

ATG ATG GAT GGC CAT GAG GTG ACC GTG GTG GAC AAC TTC TTC ACA GGC 534 
Met Met Asp Gly His Glu Val Thr Val Val Asp Asn Phe Phe Thr Gly 
70 75 80 85 

AGG AAG AGA AAT GTG GAA CAC TGG ATT GGC CAT GAG AAC TTC GAG CTG 582 
Arg Lys Arg Asn Val Glu His Trp He Gly His Glu Asn Phe Glu Leu 
90 95 100 
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ATT AAC CAT GAT GTG GTA GAG CCA CTC TAC ATC GAA GTT GAC CAG ATC 630 
He Asn His Asp Val Val Glu Pro Leu Tyr He Glu Val Asp Gin He 
105 HO 115 

TAC CAT CTG GCT TCT CCA GCC TCC CCT CCA AAC TAC ATG TAC AAC CCC 678 
Tyr His Leu Ala Ser Pro Ala Ser Pro Pro Asn Tyr Met Tyr Asn Pro 
120 125 130 

ATC AAG ACC CTG AAG ACA AAT ACA ATT GGA ACA CTA AAC ATG TTG GGA 726 
He Lys Thr Leu Lys Thr Asn Thr He Gly Thr Leu Asn Met Leu Gly 
135 140 145 

CTG GCA AAG CGT GTG GGT GCC CGT CTG CTC CTA GCC TCC ACA TCT GAG 774 
Leu Ala Lys Arg Val Gly Ala Arg Leu Leu Leu Ala Ser Thr Ser Glu 
150 155 160 165 

GTA TAT GGA GAT CCT GAG GTC CAC CCT CAA AGT GAG GAC TAC TGG GGC 822 
Val Tyr Gly Asp Pro Glu Val His Pro Gin Ser Glu Asp Tyr Trp Gly 
170 175 180 

CAC GTG AAT CCC ATA GGA CCC CGA GCC TGC TAT GAC GAG GGC AAA CGT 870 
His Val Asn Pro He Gly Pro Arg Ala Cys Tyr Asp Glu Gly Lys Arg 
185 190 195 

GTT GCG GAA ACC ATG TGC TAT GCC TAC ATG AAG CAG GAA GGT GTG GAG 918 
Val Ala Glu Thr Met Cys Tyr Ala Tyr Met Lys Gin Glu Gly Val Glu 
200 205 210 

GTG CGG GTG GCC AGG ATC TTC AAC ACC TTT GGG CCA CGA ATG CAC ATG 966 
Val Arg Val Ala Arg He Phe Asn Thr Phe Gly Pro Arg Met His Met 
215 220 225 

AAT GAT GGG AGG GTG GTC AGC AAC TTC ATC TTA CAA GCG CTA CAA GGG 1014 
Asn Asp Gly Arg Val Val Ser Asn Phe He Leu Gin Ala Leu Gin Gly 
230 235 240 245 

GAG CCG CTC ACA GTA TAT GGA TCT GGG TCT CAG ACA CGG GCA TTC CAG 1062 
Glu Pro Leu Thr Val Tyr Gly Ser Gly Ser Gin Thr Arg Ala Phe Gin 
250 255 260 

TAT GTT AGC GAT CTA GTG AAT GGC CTG GTA GCA CTG ATG AAC AGC AAT 1110 
Tyr Val Ser Asp Leu Val Asn Gly Leu Val Ala Leu Met Asn Ser Asn 
265 270 275 

GTC AGC AGC CCT GTC AAC CTG GGA AAT CCA GAA GAA CAC ACA ATC CTG 1158 
Val Ser Ser Pro Val Asn Leu Gly Asn Pro Glu Glu His Thr He Leu 
280 285 290 

GAA TTT GCT CAG TTA ATT AAA AAC CTT GTT GGT AGT GGA AGT GAA ATT 1206 
Glu Phe Ala Gin Leu He Lys Asn Leu Val Gly Ser Gly Ser Glu He 
295 300 305 

CAG TTT CTC TCT GAA GCT CAG GAT GAT CCA CAG AAA AGA AAA CCA GAC 1254 
Gin Phe Leu Ser Glu Ala Gin Asp Asp Pro Gin Lys Arg Lys Pro Asp 
310 315 320 325 

ATC AAA AAA GCA AAA CTG ATG CTG GGG TGG GAG CCT GTG GTT CCA TTG 1302 
He Lys Lys Ala Lys Leu Met Leu Gly Trp Glu Pro Val Val Pro Leu 
330 335 340 

GAG GAA GGA TTG AAC AAA GCC ATC CAC TAT TTC CGG AAG GAA CTA GAG 1350 
Glu Glu Gly Leu Asn Lys Ala He His Tyr Phe Arg Lys Glu Leu Glu 
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345 350 355 

TAC CAG GCT AAT AAC CAG TAC ATC CCT AAA CCC AAG CCT GCC AGA GTG 1398 
Tyr Gin Ala Asn Asn Gin Tyr He Pro Lys Pro Lys Pro Ala Arg Val 
360 365 370 

AAG AAG GGC CGG ACG CGC CAC AGC TGAGTTAGCC TTGGGATGCG AGACTCTATT 1452 
Lys Lys Gly Arg Thr Arg His Ser 
375 380 

TTACTTTATA AGGTGGACTT TTGTGGGATT TTTTTTTTTT TTTTAAGACT TAAACAGGTG 1512 

TCATGAAGAA CAAACTGGAA TTTTATTCTG AAGCTTGCTT TAAAGACACT GATGTGCCTA 1572 

AAAGCTCCCT TGAACTCTGC AGACTTTGCC TTGCACTTTT TAACTCTGTC TTTTTTATGC 1632 

ACAACAGCCT AGATGCATTC TCTGCTATTT TCAGGTTTTT TTATCTTGCT GTTAGAGTGT 1692 

ATGCTGTAAC TGTCACTGAC AGTTTTATTT ACTGGTTTCT TTGTGAAGCT GAAAAGAAAC 1752 

ATTAAATGGG GTGGAAAATG CCAATTTTAT TTATAAAAGT GAGTACTTTA TAAATGAGAT 1812 

GTTACACTAT GCATAAAGAA TACAAACCTA GAGGTATGGC CAGCTGGGCA GTGCACCAGT 1872 

GTTTATTTGG GGACAGATGA AAGAACTCCA TTGGAAAGCT TTGCATTTCT TTTTTAATTC 1932 

TGAATTTTCT GAAGGTCTAG TTTTCAGTTA CAGACTTGAC TTTGAAACAT CCCTGTTGGT 1992 

TCTTGATCAA AGATATTTGA AATCACTACT GTGTTGTGCT GCATATTGGG GTGGGGGTGG 2052 

GGACAATGTT AACATATTCT TGGTTAACCA TGGTTAAATA TGCTATTTTA ATAAAATATT 2112 

GAAACTCGTC AAAAA 2127 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 81 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Arg Ser He Gin Glu Asn Gly Glu Leu Lys He Glu Ser Lys He 
1 5 io 15 

Glu Glu He He Glu Pro Leu Arg Glu Lys lie Arg Asp Leu Glu Lvs 
20 25 30 

Ser Phe Thr Gin Lys Tyr Pro Pro Val Lys Phe Leu Ser Glu Lys Asp 
35 40 45 

Arg Lys Arg He Leu He Thr Gly Gly Ala Gly Phe Val Gly Ser His 
50 55 60 

Leu Thr Asp Lys Leu Met Met Asp Gly His Glu Val Thr Val Val Asp 
65 70 75 80 

Asn Phe Phe Thr Gly Arg Lys Arg Asn Val Glu His Trp He Gly His 
85 90 95 
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Glu Asn Phe Glu Leu lie Asn His Asp Val Val Glu Pro Leu Tyr He 
100 105 110 

Glu Val Asp Gin He Tyr His Leu Ala Ser Pro Ala Ser Pro Pro Asn 
115 120 125 

Tyr Met Tyr Asn Pro He Lys Thr Leu Lys Thr Asn Thr He Gly Thr 
130 135 140 

Leu Asn Met Leu Gly Leu Ala Lys Arg Val Gly Ala Arg Leu Leu Leu 
145 150 155 " 160 

Ala Ser Thr Ser Glu Val Tyr Gly Asp Pro Glu Val His Pro Gin Ser 
165 170 175 

Glu Asp Tyr Trp Gly His Val Asn Pro He Gly Pro Arg Ala Cys Tyr 
180 185 190 

Asp Glu Gly Lys Arg Val Ala Glu Thr Met Cys Tyr Ala Tyr Met Lys 
195 200 205 

Gin Glu Gly Val Glu Val Arg Val Ala Arg He Phe Asn Thr Phe Gly 
210 215 220 

Pro Arg Met His Met Asn Asp Gly Arg Val Val Ser Asn Phe He Leu 
225 230 235 240 

Gin Ala Leu Gin Gly Glu Pro Leu Thr Val Tyr Gly Ser Gly Ser Gin 
245 250 255 

Thr Arg Ala Phe Gin Tyr Val Ser Asp Leu Val Asn Gly Leu Val Ala 
260 265 270 

Leu Met Asn Ser Asn Val Ser Ser Pro Val Asn Leu Gly Asn Pro Glu 
275 280 285 

Glu His Thr He Leu Glu Phe Ala Gin Leu He Lys Asn Leu Val Gly 
290 295 300 

Ser Gly Ser Glu He Gin Phe Leu Ser Glu Ala Gin Asp Asp Pro Gin 
305 310 315 320 

Lys Arg Lys Pro Asp He Lys Lys Ala Lys Leu Met Leu Gly Trp Glu 
325 330 335 

Pro Val Val Pro Leu Glu Glu Gly Leu Asn Lys Ala He His Tyr Phe 
340 345 350 

Arg Lys Glu Leu Glu Tyr Gin Ala Asn Asn Gin Tyr He Pro Lys Pro 
355 360 365 

Lys Pro Ala Arg Val Lys Lys Gly Arg Thr Arg His Ser 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW038 

(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION: 1440.. 1714 

(D) OTHER INFORMATION: /label= SAC_24332 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



GTCGACGGTT 


GAGTCTGATG 


AGGAAGATCA AGAAAAGACT GAGGAGACTC CTTCCCAGCT 


60 


GGACCAGGAT 


ACTCAAGTAC 


AGGACATGGA TGAGGGTTCA GACGATGAAG AAGAAGGACA 


120 


GAAAGTGCCT 


CCACCCCCAG 


AGACACCCAT GCCTCCACCT CTGCCCCCAA CACCAGACCA 


180 


AGTCATTGTG CGAAAGGACT ATGATCCCAA AGCTTCCAAG CCCCTTCCTC CAGCTCCTGC 


240 


TCCTGATGAG 


TATCTTGTGT CCCCCATAAC TGGGGAGAAG ATTCCTGCCA GCAAAATGCA 


300 


AGAACACATG 


CGCATCGGGC 


TTCTTGACCC CCGCTGGCTG GAGCAACGGG ATCGCTCCAT 


360 


CCGAGAGAAG 


CAGAGCANTG 


ATGAGGTGTA CGCACCAGGT CTGGATATTG AGAGCAGCCT 


420 


AAAGCAGTTG 


GCTGAGCGAC 


GTACTGACAT CTTTGGGGTA GAAGAGACAG CCATTGGTAA 


480 


GAAGATTGGT 


GAAGAGGAGA 


TCCAGAAGCC AGAGGAAAAG GTGACTTGGG ATGGCCATTC 


540 


AGGCAGCATG 


GCCCGGACCC 


AGCAGGCTGC CCAGGCCAAC ATCACCTCCA GGAGCAGATT 


600 


GAGGCCATCC 


ATAAGGCCAA 


GGGCCTGGTG CCAGAAGATG ACACCAAAGA GAAGATTGGC 


660 


CCTAGCAAGC 


CCAATGAAAT 


CCCTCAGCAG CCACCGCTCC ATCTTCAGCC ACCAATATCC 


720 


CCAGCTCAGC 


GCCCCCCATC 


ACGTCATTCC CCGGCCACCT GCGATGCCAC CTCCAGTCCG 


780 


CACCACTGTC 


GTATCTGCAG 


TGCCTGTCAT GCCTCGGCCC CCCATGGCAT CTGTGGTCCG 


840 


GCTGCCCCCA 


GGCTCAGTGA 


TTGCCCCGAT GCCACCCATC ATCCACGCAC CTAGGATCAA 


900 


CGTGGTACCC 


ATGCCTCCTG 


CAGCACCNCC CATCATGGCA CCCCGNCCAC CTCCCATGAT 


960 


TGTGCCAACA 


GCCTTTGTGC 


CTGCTCCCCC CTGTGGCTCC AGTGCCAGCC CCAGCTCCAA 


1020 


TGCCTCCCGT ACACCCCCCA CCTCCTATGG AAGATGAGCC TCCATCCAAG AAACTAAAGA 


1080 


CCGAGGACAG 


CCTCATGCCT 


GAGGAAGAGT TCCTGCGCAG GAACAAGGGT CCAGTATCCA 


1140 


TCAAGGTGCA 


GGTACCCAAC ATGCAGGATA AGACAGAATG GAAGCTGAAT GGGCAGGGGC 


1200 


TGGTCTTCAC 


TCTCCCGCTC ACAGACCAGG TCTCTGTCAT CAAAGTGAAG ATTCATGAAG 


1260 


CNACGGGCAT GCCTGCAGGG AAGCAGAAAC TACAGTATGA AGGCATCTTC ATCAAGGACT 


1320 


CCAACTCCCT GGCTTACTAC AACATGGCCA GTGGTGCTGT CATCCACTTG GCCCTCAAGG 


1380 
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AAAGAGGCGG CCGGAAGAAG TAGCCTCGAG AGATACTGCA AGTGGACTCC TGCCATTGTA 1440 

CCTCTTTCCA CCACCAATTC CGGGAGTTTC CCTGGGACCT CGGGCACAGC CTGGGCATTT 1500 

GTCTCCCTCT AAGGCAGTCT ATGGAATGGA TATCTGTTTA GATTTGGATT GGAGGTCCTT 1560 

CAAGGAAGCC CATTAGGAAT TGAAAACCCA GGGTTCTCCC ACAGGAGAGC TTCATAATAC 1620 

CAAGTAATGT TTGGCTAATA GAGTATGGTT ACTACCATTC ATCATTGTGT ATCATGCTTT 1680 

GGCCTCTAAT AAACCTCTTG AAGCCAGGCA TGGTACACAC ATGCCTGTAA TCCCGGGACT 1740 

TGGGGGGCGG GCAGAAGAAT TAAATGTTCA TCTTGGCTGT AAAAA 1785 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW039 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 231.. 617 

(D) OTHER INFORMATION: /label= SAC_24045 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TTTTTCAGAA TTGGCTGACT TTAATTTGGA AACAGCTTTT GGCTACAGAC ATTCAGGTTT 60 

AAGGCACATT CCAAACTACA AGGATCCCTG CCACTGGCAC GGGGCCTGGC TTTAATTCTT 120 

GCATGGTTTT ACATGGAAAG TGGAACAAAT GATTTTCTGG CCTCTTCTAA AAAAGAGTGT 180 

GAGTTTGGCA AAAACAAAAC TCATGACACA TTCTCACCAG ACACCACAGT ACCGTTACAA 240 

GACCTCCCTT TTGTCCTAGG CAAAGACGGG TGGCCCGCTC ACCTACCATT CTCTCCAGAA 300 

ACATGTATAC AGTAAGCATG CTTGTATGCG TCTGTCTGGC AGCGATGGTT AAAAAAAGTT 360 

TTCAAATAAA AATACAAAAA TCACACGTCT ATAACAAAGC TAAGAGTCTT AGCCACAAGC 420 

AGCGAGGAAT GAGTGTAGGA ACCAGGAGAG ATAAACATCA ATTCTGCACA GCAAGGGGCT 480 

GTGAGGGCCA GGGAGATGGT CACCTGAGAA CCTGGCCCAC CTGGGCCCTT CGGCACAATA 540 

CGTGGATGAG GGGGGACCAG AAGGAAAAGG TCATGGCTTT GGGGATGGGT GGATGGCTGG 600 

GGGGTGCACT TGTGGGTACT AGATAGATTG GGGGAACTGG TGTGCTCTTC GGGACGTTTC 660 
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CTGGCCACTG CCAGCAGGGG GCCAGCAGGG GGCACAGGTG AGCCACTCCC ACCGCTCAGT 720 

CGTTGCCACC ACAGATCTTC AGCAGGATCT TCCGATAATC CCCAGAAGTG TCTCCCGTGA 780 

TATCGTGGTA CAGTGACTTG CCGTACATCC GCTTATACTC TGCTCGGATA TCCAGCAGGT 840 

CAAGCTCGCT TCGAGACACC ATGATGCGAA TCAGGGTCCG GTCTTTTGTT CCTGCTCCCC 900 

CTCATAGCCT TGTTGAGCCT TTCGGCAAAG AAGGCAGGGG TGTTCTTGAG GCATTTCACC 960 

ACAGCCAGCA TGCCCTGCTC CAGGTCCCCA GACATCTCCG GCAGATGCTC TTCTCGATGT 1020 
CTCGGCCTGT CATCCTCTGA TACTCGTTGA AACT 

1054 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 3306 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW040 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 481. .2433 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: complement (2395.. 2588) 

(D) OTHER INFORMATION: /label- SAC_24520 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTCGACGCCT GGTGTGGCTG GCGGCACCAG CTGAGTATTC CCTGAAAGAA TGAAGATGAC 60 
TGGCCTAGAG GAGTCAGGAA TGTGACTGCT GGGAGATCCA GCTAGAGCAA CCATACCTGC 120 
TCTTCCAGGA CCTTCCCCCT AACAGCTAAG GAGCCTTAGG TCTGGGAGAG GCAGGCACCT 
GGCTGCAGAA GGGGAAACAA CCTTTTTCTC TTTCTCTCCT CCACTACTCT CCTGATGGCT 
GCAGAAGAAG AAAGATCTAC CACGGGAAAC TCGGAGAGCT AGTCGGTTAA CTGAGCTGTT 
GGATCTCCAC GGCACGAGGC AAATCCGTGA TGTGACTTTT TGTTTTTTCT CCTAGGTCAG 
CTGAGGGAAC AGCTCAGTTA CCTTAAGGGC GATAACTTTT TCAGGTTTAC TTGTGCCGAT 
TGCTCGGCAG ACGGCAAGGA GCAGTCCGAG AGGCTGAAGC TGACATGGCA GCAAGTGGTC 

Leu SI I AC ^ C TCT CTG GAA GGA AGT GGA CGT CAA GGC 

Met Leu Ala Met Tyr Asn Leu Ser Leu Glu Gly Ser Gly Arg Sn QlS 

5 10 15 

TAG TTC CGG TGG AAA GAA GAT ATC TGT GCT TTT ATT GAG AAA CAT TGG 



180 
240 
300 
360 
420 
480 
528 



576 
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Tyr Phe Arg Trp Lys Glu Asp lie Cys Ala Phe lie Glu Lys His Trp 
20 25 30 

ACT TTT TTA CTA GGA AAT AGG AAA AAG ACT TCG ACG TGG TGG AGC ACA 624 
Thr Phe Leu Leu Gly Asn Arg Lys Lys Thr Ser Thr Trp Trp Ser Thr 
35 40 45 

GTA GCA GGT TGC CTC AGC GTG GGA AGT CCT GTT TAT TTC CGT TCA GGT 672 
Val Ala Gly Cys Leu Ser Val Gly Ser Pro Val Tyr Phe Arg Ser Gly 
50 55 60 

GCT CAG GAA TTT GGA GAG CCT GGA TGG TGG AAA CTT GTT CAT AAC AGA 720 
Ala Gin Glu Phe Gly Glu Pro Gly Trp Trp Lys Leu Val His Asn Arg 
65 70 75 80 

CCC CCA ACA ATG AGA CCA GAG GGA GAG AAG CTG TCT GCC TCC ACT TTG 768 
Pro Pro Thr Met Arg Pro Glu Gly Glu Lys Leu Ser Ala Ser Thr Leu 
85 90 95 

AAA GTG AAA GCT TCA AAA CCA ACA CTG GAT CCC ATC ATT ACC GTT GAA 816 
Lys Val Lys Ala Ser Lys Pro Thr Leu Asp Pro lie He Thr Val Glu 
100 105 110 

GGA CTC AGA AAA CGA GCA AGC CGG AAT CCT GTG GAA TCA GCC ATG GAA 864 
Gly Leu Arg Lys Arg Ala Ser Arg Asn Pro Val Glu Ser Ala Met Glu 
115 120 125 

TTG AAA GAG AAG CGG TCT CGA ACA CAG GAA GCT AAA GAC ATC AGA AGA 912 
Leu Lys Glu Lys Arg Ser Arg Thr Gin Glu Ala Lys Asp He Arg Arg 
130 135 140 

GCA CAG AAA GAA GCA GCC GGC CTC CTT GAC AGA AGC ACC TCT TCT ACC 960 
Ala Gin Lys Glu Ala Ala Gly Leu Leu Asp Arg Ser Thr Ser Ser Thr 
145 150 155 160 

CCT GTC AAG TTC ATA AGC CGA GGC CGG AGG CCG GAT ATG ATT TTG GAG 1008 
Pro Val Lys Phe He Ser Arg Gly Arg Arg Pro Asp Met He Leu Glu 
165 170 175 

AAA GGA GAA GTG ATT GAC TTT TCA TCC CTG AGC TCC TCG GAC CGT ACC 1056 
Lys Gly Glu Val lie Asp Phe Ser Ser Leu Ser Ser Ser Asp Arg Thr 
180 185 190 

CCC CTC ACA AGC CCC TCT CCT TCT CCC TCT CTG GAT TTC TCT GCC CCA 1104 
Pro Leu Thr Ser Pro Ser Pro Ser Pro Ser Leu Asp Phe Ser Ala Pro 
195 200 205 

GGG ACA CCA GCG TCA CAT TCT GCC ACG CCT AGC TTG CTC TCT GAA GCA 1152 
Gly Thr Pro Ala Ser His Ser Ala Thr Pro Ser Leu Leu Ser Glu Ala 
210 215 220 

GAC CTC ATC CCA GAC GTG ATG CCA CCC CAA GCT CTA TTT CAT GAT GAT 1200 
Asp Leu He Pro Asp Val Met Pro Pro Gin Ala Leu Phe His Asp Asp 
225 230 235 240 

GAT GAG CTG GAA GGA GAT GGA GTC ATA GAC CCA GGG ATG GAG TAC ATC 1248 
Asp Glu Leu Glu Gly Asp Gly Val He Asp Pro Gly Met Glu Tyr He 
245 250 255 

CCA CCC CCA GCT GGG TCA GCT TCT GGG CTG ATG GGG AGC AGA AAG AAG 1296 
Pro Pro Pro Ala Gly Ser Ala Ser Gly Leu Met Gly Ser Arg Lys Lys 
260 265 270 
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GTC AGA GCT GCA GAG CAG ATA AAA CAG GAA GTC GAC AGC GAG GAG GAG 1344 
Val Arg Ala Ala Glu Gin He Lys Gin Glu Val Asp Ser Glu Glu Glu 
275 280 285 

AAG CCA GAC AGG ATG GAT GGA GAT AGC GAA GAC ACA GAT TCA AAC ACT 1392 
Lys Pro Asp Arg Met Asp Gly Asp Ser Glu Asp Thr Asp Ser Asn Thr 
290 295 300 

TCT TTG CAC ACT AGA GCT CGA GAA AAG AGG AAG CCA CCC CTG GAG AAG 1440 
Ser Leu His Thr Arg Ala Arg Glu Lys Arg Lys Pro Pro Leu Glu Lys 
305 310 315 320 

GAC ACG AAG CCC AAG GGG CCC AGG TAT ACG CCT GTG AGC ATC TAT GAG 1488 
Asp Thr Lys Pro Lys Gly Pro Arg Tyr Thr Pro Val Ser He Tyr Glu 
325 330 335 

GAG AAG CTG CTG CTC AAG AGG CTG GAG GCA TGC CCT GGT GCC GTG GCC 1536 
Glu Lys Leu Leu Leu Lys Arg Leu Glu Ala Cys Pro Gly Ala Val Ala 
340 345 350 

ATG ACT CCA GAA GCC CGG AGG CTG AAA CGG AAG CTG ATC GTC CGG CAA 1584 
Met Thr Pro Glu Ala Arg Arg Leu Lys Arg Lys Leu He Val Arg Gin 
355 360 365 

GCA AAA AGG GAT CGG GGC TTA CCA CTG TTT GAC TTG GAC GAG GTT GTG 1632 
Ala Lys Arg Asp Arg Gly Leu Pro Leu Phe Asp Leu Asp Glu Val Val 
370 375 380 

AAC GCA GCA CTT CTG TTA GTG GAT GGG ATT TAT GGA GCC AAA GAT GGA 1680 
Asn Ala Ala Leu Leu Leu Val Asp Gly He Tyr Gly Ala Lys Asp Gly 
385 390 395 400 

GGA GCT TCC CGG CTT CCA GCT GGA CAA GCT ACA TAC CGG ACC ACC TGC 1728 
Gly Ala Ser Arg Leu Pro Ala Gly Gin Ala Thr Tyr Arg Thr Thr Cys 
405 410 ~ 415 

CAG GAC TTC AGG ATC CTT GAC CGG TAC CAG ACT GCC TTA CCA GCC AGG 1776 
Gin Asp Phe Arg He Leu Asp Arg Tyr Gin Thr Ala Leu Pro Ala Arg 
420 425 430 

AAA GGA TTC CGG CAC CAG ACC ACC AGA TTT TTG TAT CGT CTG GTG GGA 1824 
Lys Gly Phe Arg His Gin Thr Thr Arg Phe Leu Tyr Arg Leu Val Glv 
435 440 445 

TCA GAA GAT CTG GCT GTG GAC CAA AGT ATT GTC AGC CCT TAC ACT TCT 1872 
Ser Glu Asp Leu Ala Val Asp Gin Ser He Val Ser Pro Tyr Thr Ser 
450 455 460 

CGG ATC TTG AAA CCT TAT ATC AGG CGT GAT TAT GAG ACA AAG CCA CCC 1920 
Arg He Leu Lys Pro Tyr He Arg Arg Asp Tyr Glu Thr Lys Pro Pro 
465 470 475 480 

AAA CTA CAG CTG CTG TCC CAG ATT CGC TCC CAC CTG CAC AAG AGT GAC 1968 
Lys Leu Gin Leu Leu Ser Gin He Arg Ser His Leu His Lys Ser Asp 
485 490 495 

CCT CAC TGG ACA CCT GAA CCT GAT GCA CCT CTC GAT TAC TGC TAT GTC 2016 
Pro His Trp Thr Pro Glu Pro Asp Ala Pro Leu Asp Tyr Cys Tyr Val 
500 505 510 

CGA CCA AAT CAC ATC CCA ACG ATC AAC TCC ATG TGT CAG GAA TTT TTC 2064 
Arg Pro Asn His He Pro Thr He Asn Ser Met Cys Gin Glu Phe Phe 
515 520 525 
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TGG CCT GGC ATT GAC CTG TCT GAG TGT CTG CAG TAT CCA GAC TTC AGT 2112 
Trp Pro Gly lie Asp Leu Ser Glu Cys Leu Gin Tyr Pro Asp Phe Ser 
530 535 540 

GTC GTA GTC CTT TAT AAA AAA GTC ATT GTT GCC TTT GGC TTC ATG GTT 2160 
Val Val Val Leu Tyr Lys Lys Val He Val Ala Phe Gly Phe Met Val 
545 550 555 560 

CCC GAT GTG AAG TAC AAC GAA GCT TAC ATT TCA TTT CTG CTT GTT CAT 2208 
Pro Asp Val Lys Tyr Asn Glu Ala Tyr He Ser Phe Leu Leu Val His 
565 570 575 

CCC GAG TGG AGG AGA GCA GGG ATT GCC ACG TTC ATG ATC TAT CAT CTG 2256 
Pro Glu Trp Arg Arg Ala Gly He Ala Thr Phe Met He Tyr His Leu 
580 585 590 

ATT CAG ACG TGC ATG GGC AAG GAT GTG ACT CTT CAC GTC TCG GCG AGC 2304 
He Gin Thr Cys Met Gly Lys Asp Val Thr Leu His Val Ser Ala Ser 
595 600 605 

AAC CCT GCC ATG CTG CTG TAC CAG AAG TTT GGC TTC AAG ACA GAG GAG 2352 
Asn Pro Ala Met Leu Leu Tyr Gin Lys Phe Gly Phe Lys Thr Glu Glu 
610 615 620 

TAT GTT TTG GAT TTT TAT GAT AAG TAT TAC CCA CTG GAG AGT ACA GAG 2400 
Tyr Val Leu Asp Phe Tyr Asp Lys Tyr Tyr Pro Leu Glu Ser Thr Glu 
625 630 635 640 

TGT AAA CAT GCA TTC TTC CTG AGG CTC CGA CGC TGAGGTGAAC TCAGTTCTTC 2453 
Cys Lys His Ala Phe Phe Leu Arg Leu Arg Arg 
645 650 



CCAGCAAACA 


GCAGATGCTG 


TCAGAGAGTG 


GCCTCAGAAG 


TCGTGCAGGC 


CCTTCTCCTC 


2513 


GAAGTGGGCC 


CTGGCCTCCT 


GTGAGTAATG 


GCCTCCACAG 


GGTCCAGCTG 


GGTGAGTGAA 


2573 


GCCATGTGCA 


GAGGTACAAG 


TGGCTGTTCA 


ATCCAGAAAC 


CATCCAGCAG 


GCCCTGCGTG 


2633 


GCCTTCCAGC 


AGTATCTTGG 


ACTCCACTGT 


GGACCAGAAC 


CAAGTGACTC 


TGCCATTCTT 


2693 


TTTCCACACA 


ATGAGTGGGC 


ACAAAAGAGC 


TTCTGAGCTG AAGCATTTGA GAATGGGCCT 


2753 


GCTCCTCCTA 


TGTGAGAGCT 


CAGGAAGTAT 


GAAAGATGTT AAAGACTGCC AGGGCCCTCA 


2813 


CTGTCCTGGG 


CCGAGAGGTT 


ATGAGGTGTC 


AGGAATGACT 


CCAACCCTGT 


CTCAGCCTGT 


2873 


GCTGTGCTGT 


CATCGAACCC 


CAGAGGATTC 


ATGGGGCACT 


GAAGTCCTTT 


TGTCTCTTAC 


2933 


TGACTTTCGT 


CCTGGTCAGG 


AAAAAGATAC 


TCGGTGTCTG 


TTTCCAGACT 


GCTGCTGCTT 


2993 


TACTTATCTA 


CCTGTCAGCA 


CCCCTTGCAG 


GGACATTGAC 


CTGGTACAAG 


AAGTTTACTC 


3053 


TTGGCATGGC 


CTGGAAAGTA 


GCCCGAGAGG 


AACGGTTGAG 


CTTGATACAT 


TAGATGCCTT 


3113 


CCCAGGCCCT 


TCTCATGCTC 


AGGACAGTTG 


CACCACAGAA 


GTGGAGGTGT 


GGAATGGCGC 


3173 


AGGCAGGCCT 


GGGCGCCCTG 


TCCACCTCAT 


GAAGCTGCTT 


CCTGGGTTGC 


ACATGTTGGA 


3233 


GTTGGACTAC 


AGGTATCTAA 


GTGCATTCCA 


AAAGCATCAG ATATCTCCAA TAAAAATCCT 


3293 


TACCATACAA 


AAA 










3306 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Leu Ala Met Tyr Asn Leu Ser Leu Glu Gly Ser Gly Arg Gin Gly 
15 10 15 

Tyr Phe Arg Trp Lys Glu Asp He Cys Ala Phe He Glu Lys His Tro 
20 25 30 

Thr Phe Leu Leu Gly Asn Arg Lys Lys Thr Ser Thr Trp Trp Ser Thr 
35 40 45 

Val Ala Gly Cys Leu Ser Val Gly Ser Pro Val Tyr Phe Arg Ser Gly 
50 55 60 

Ala Gin Glu Phe Gly Glu Pro Gly Trp Trp Lys Leu Val His Asn Arg 
65 70 75 80 

Pro Pro Thr Met Arg Pro Glu Gly Glu Lys Leu Ser Ala Ser Thr Leu 
85 90 95 

Lys Val Lys Ala Ser Lys Pro Thr Leu Asp Pro He He Thr Val Glu 
100 105 no 

Gly Leu Arg Lys Arg Ala Ser Arg Asn Pro Val Glu Ser Ala Met Glu 
115 120 125 

Leu Lys Glu Lys Arg Ser Arg Thr Gin Glu Ala Lys Asp He Arg Arg 
130 135 140 

Ala Gin Lys Glu Ala Ala Gly Leu Leu Asp Arg Ser Thr Ser Ser Thr 
145 150 155 160 

Pro Val Lys Phe He Ser Arg Gly Arg Arg Pro Asp Met He Leu Glu 
165 170 175 

Lys Gly Glu Val He Asp Phe Ser Ser Leu Ser Ser Ser Asp Arg Thr 
180 185 190 

Pro Leu Thr Ser Pro Ser Pro Ser Pro Ser Leu Asp Phe Ser Ala Pro 
195 200 205 

Gly Thr Pro Ala Ser His Ser Ala Thr Pro Ser Leu Leu Ser Glu Ala 
210 215 220 

Asp Leu He Pro Asp Val Met Pro Pro Gin Ala Leu Phe His Asp Asp 
225 230 235 240 

Asp Glu Leu Glu Gly Asp Gly Val He Asp Pro Gly Met Glu Tyr He 
245 250 255 

Pro Pro Pro Ala Gly Ser Ala Ser Gly Leu Met Gly Ser Arg Lys Lys 
260 265 270 
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Val Arg Ala Ala Glu Gin He Lys Gin Glu Val Asp Ser Glu Glu Glu 
275 280 285 

Lys Pro Asp Arg Met Asp Gly Asp Ser Glu Asp Thr Asp Ser Asn Thr 
290 295 300 

Ser Leu His Thr Arg Ala Arg Glu Lys Arg Lys Pro Pro Leu Glu Lys 
305 310 315 320 

Asp Thr Lys Pro Lys Gly Pro Arg Tyr Thr Pro Val Ser He Tyr Glu 
325 330 335 

Glu Lys Leu Leu Leu Lys Arg Leu Glu Ala Cys Pro Gly Ala Val Ala 
340 345 350 

Met Thr Pro Glu Ala Arg Arg Leu Lys Arg Lys Leu He Val Arg Gin 
355 360 365 

Ala Lys Arg Asp Arg Gly Leu Pro Leu Phe Asp Leu Asp Glu Val Val 
370 375 380 

Asn Ala Ala Leu Leu Leu Val Asp Gly He Tyr Gly Ala Lys Asp Gly 
385 390 395 40Q 

Gly Ala Ser Arg Leu Pro Ala Gly Gin Ala Thr Tyr Arg Thr Thr Cvs 
405 410 415 

Gin Asp Phe Arg He Leu Asp Arg Tyr Gin Thr Ala Leu Pro Ala Ara 
420 425 430 

Lys Gly Phe Arg His Gin Thr Thr Arg Phe Leu Tyr Arg Leu Val Gly 
435 440 445 

Ser Glu Asp Leu Ala Val Asp Gin Ser He Val Ser Pro Tyr Thr Ser 
450 455 460 

Arg He Leu Lys Pro Tyr He Arg Arg Asp Tyr Glu Thr Lys Pro Pro 
465 470 475 480 

Lys Leu Gin Leu Leu Ser Gin He Arg Ser His Leu His Lys Ser Asd 
485 490 495 

Pro His Trp Thr Pro Glu Pro Asp Ala Pro Leu Asp Tyr Cys Tyr Val 
500 505 510 

Arg Pro Asn His He Pro Thr He Asn Ser Met Cys Gin Glu Phe Phe 
515 520 525 

Trp Pro Gly He Asp Leu Ser Glu Cys Leu Gin Tyr Pro Asp Phe Ser 
530 535 54 Q 

Val Val Val Leu Tyr Lys Lys Val He Val Ala Phe Gly Phe Met Val 
545 55 ° 555 560 

Pro Asp Val Lys Tyr Asn Glu Ala Tyr He Ser Phe Leu Leu Val His 
565 570 575 

Pro Glu Trp Arg Arg Ala Gly He Ala Thr Phe Met He Tyr His Leu 
580 585 590 

He Gin Thr Cys Met Gly Lys Asp Val Thr Leu His Val Ser Ala Ser 
595 600 605 
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Asn Pro Ala Met Leu Leu Tyr Gin Lys Phe Gly Phe Lys Thr Glu Glu 
610 615 620 

Tyr Val Leu Asp Phe Tyr Asp Lys Tyr Tyr Pro Leu Glu Ser Thr Glu 
625 630 635 640 

Cys Lys His Ala Phe Phe Leu Arg Leu Arg Arg 
645 650 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW041 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 3407.. 3482 

(D) OTHER INFORMATION: /label= SAC_24317 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



GTCGACCCTC 


CTTCTGCCCT 


GGCTCCTGTT 


GATGAGCACA 


GTGCCGTCCC 


TGATCAGTTC 


60 


TTTCTTCCTG 


TGAATTTGCT 


TCGTAGCCCA 


TCAGGCTGGT 


AGACTGGCAG 


CACGGTCAGT 


120 


GTCACACGAG 


GGACAGGACT 


TCCCTGAAGT 


GAAGCAGCAA 


TGTCATGGAA 


GCAAACCCCT 


180 


GTGAGCGAGA 


GACAGGCCCC 


GAGGCTGCTG 


CCACAAGGTC 


TCATGTTCAT 


TGCTGGGGAC 


240 


ACTAGCTTCC 


TACAAACCCC 


AGGAGACGCT 


GCCCATAGCA 


CGGGTGGCCC 


TTCAGGCTCC 


300 


TGAACTATCT 


ATCCCCAGAT 


CAGTAGGATT 


TGTTTTCCAA 


GTCCCCTTCC 


TCATTCCTTC 


360 


TTTGCTCAAG 


AGAAAAAAGA 


AAAGTGGTAA 


CTCTGTGACT 


CTAAGTCCCT 


CTTAAATGCA 


420 


GCGCTGAATC 


TAGCTTGGCT 


CGGAGATCTC 


TGACTTGTCC 


CTAGGGTGAG 


ACTGTGGCCT 


480 


CCCCAGGTTG 


AGGTGGCACG 


TGTATTGTGA 


GAGATTAGCC 


TGGAGAGCAG 


AGAAGGGGGG 


540 


AGGGGGGATA 


GGGAGGAGAG 


AGCAATCGTG 


TCTCCCTACA 


GGACCCTGGG 


CCATCTGACA 


600 


AGACAGAGGA 


GGTGGGACTG 


GCTTCCCCAT 


GCTGCACATG 


GCAGTGGGAA 


CCAGGCTTGG 


660 


AGTCCCCCAC 


CCCCTCCTTA 


CCTCTTCTCT 


CAGACCTTGT 


AGCCCTAGCT 


TGCCCTGTGC 


720 


TCTGCAGGGA 


AGGGGATGAG 


AGGACAGCAT 


AAGAAGTCTT 


GCCAGTTGAG 


GTGGGGCTGG 


780 


AAGATGAAGC 


AGGTGGGTGT 


AGGAATCATT 


TTGGATAGGG 


CATGTTGTTT 


GGTTGACGGC 


840 


AACTAATTCT 


CTTAAGAGCA 


TATTGTCTCC 


AGTTAATGTT 


GGAGACAACC 


CAGAAAGGGA 


900 
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AGGGTATAGA GATGGGTAAT TAATGAGATA TTCCAGCCCT GGAGCAATGT CTGAGCCACC 960 

CCAAACTTAG GCTGTCTCCC AGCATAGGAG TGGGGCACTG CCCCTTACTC AAATTCTAGT 1020 

CTAAGTGTTC CAGACACATT GTACACTTGG ACAGGAAATC TCCTGTTCCA GCATACCCTC 1080 

TCCCCCATAC CCTCTCCCCC ATACCCCCTC CCCCATACCT TCTCCCCCAT ACTGTTAGTA 1140 

CGTTTCAGTT GTTCAAAAGC AAAGGGGGCG CAGACTAGGC TTTAACTGCA TGTGGAGTGT 1200 

AGATTTTGCC TAAGAATCCA TGTTGAAGAA GAGAGAGGCT ATGTCTTTAA GGTCGAGGCC 1260 

CTTCAGTCTT CTGTCCTTGT AATTCTCTTG AAGCGTGAGC CCCAGGCACT CAGAAGCCAT 1320 

GTTGTACTCC GGGAAGTTCA TGGGACCTGA ACTCCTTCAA CACCGGGAAC AACAGTACTG 1380 

CATAGGTACT TTGGTACTGA CTGTGCAAGC ACTAGAAAAA GTACCCGTTG AATGGCCCTC 1440 

ACAGCATCAG GGAAACGAGG CCGAGTTCAC TCTCCTGATT GCTCAGATGC AGAAGCTGGA 1500 

CCCGTGAGGT GTACCAGCCC ACCTCCAAGC TTGACACAGA GGCTAGCACT TAAAGCAAAG 1560 

ACTCAGAAGA GGAGTCTCAG TCACAGGCTG CACAGCCTTG AACATGATAT CTAGAACTAT 1620 

GCCTGGGTGA TCGGTGGCCC AACTTGTGAC ACTTGTGTGC ACATAGGGAA GCTGACGCTC 1680 

CTCTCGGGGC CTGAAAACAG GACTAAACCC AAACCCCATA TAACTAACCA TACCACTGCC 1740 

ATCCACAGGC GTGGATGGAA GTCTTCACGG TCTGTTACAA GCACAAGTAC AAAGCCGATA 1800 

AAAATAAGTC AAAGCCCTTC CTCGTGGTGC AGAGGGCTCC TCCTCAGATC TCTTGGAGCC 1860 

TTTGATCATA GAGCTTGACG GTCTCCTCTC GGGATAAAGG ACAGGTAAAG GCTGTGGCCT 1920 

TCAGCGGTGA CTGCCCCACA CCCTGCTTTC CTTAGGGGTG GAGACAAGGA ATGGTGGGGC 1980 

CAGTGTTGTT CAGAGAGAAG TCTCTCTTTG GAGCCCATGT ATGGGGCTTG ACCTTTTGCT 2040 

GTGAGTGCAT TTAGAGGCAA GTGAATTCTG GCCTCAGTAC AGAGAACCCA GCGGTTTCAG 2100 

CACTCAAGAA AGAAGCCGCA GTGTCCCCTA CAGAGAGGCT CTGTCCTGCT TGGCCCCAGC 2160 

CACCTGCGCC ACACATTTAA TAAAGTGGCT GAAACTCCTC CTTTCCAAGA GCTAGTGTGC 2220 

TGGGTCCTGT CCTCCCCACA GCCTTCTGAG TCCCAGAGAC TTGTGTGAGC AGTCGGATAA 2280 

GGCCAATCTG GGAGACAGGG GATGACCATG ACACAAGGTG TTTCCAGGCA GGTCCCTTGT 2340 

GCGTAGGCTG CAGAGAGCTG TCCTGACTGG GAGCTTGGGT TGTACGAACA CATGGTGCCT 2400 

TCTTTGTCTG TGCTGAACGA GTAGGGTGTG TTTCTCTGTG TTCTCTGAGT AAGATCACAC 2460 

CCTCTTTAAG CAAAGGGGTG TTGTGGATAT TAGAAGCAAG ATTGATAAAT GTTCTTAAAT 2520 

GGTGGTGCTA GAGAAAAGTC AGTCCCCGTA ACTTCCTAAA AGCTGAGTCA GCCAGTCTCT 2580 

AAGTATGTAC ATGTTGGCTT TCTTCTATTT TTTTTCTAGT TAATCAGTGT GTTTTAAATG 2640 

TTTTTAGTTG ATGTGAGGAT TTTTTTAAAG GCATTTGTGT GTGGTGTGGC TCCATGGGAG 2700 

ATAGGCTGTG TAATAAGAAC CCTCAGAAGC CAGCCAGTCA TCCAAGGGGC GCCCAAGTTC 2760 

AGTGGATTTC TTCAGGAATT GGAGAGCAGT ATTTTTCCCT CTTTTCAGTA GTATGCCTGT 2820 
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TCTCATTTAG TAATTTAATG GTATATTGCA TTATATGTGA ATGCATACAT AAACGGTGGA 2880 

ATTTCTCTTA ATATCCAAAA AAGTCCTTCT GTTGCCCTCT GAGCTCTCTC CTTGGGGAGG 2940 

ACTGCAGGGC TCACTGTGTA AAATGGTTGT GGACTCCTCT AGCAGAATGG CTTCCAAGGC 3000 

CTGGCCCATT CCTCAGATTC CCTTTGAAGT GAGAGGTCAT TGTCTTTTGA GAATGGATTT 3060 

AAGTTTTGGA AATACCCCAC GCTGTTCACA GTGCAAGCGA CTGAACACAG AACAATTAGT 3120 

CAGAATCAGA ATCCCAGGGC CATCAAGATG ACTCCACGGA AAAGGGACTG CCACCAACCA 3180 

GAGCCAAGAC CTGAGTTTGG TCCCCAGGAA CCCCATGGCA GCAGGAGAGA GTCAGCCGAT 3240 

GCAAGTGTCA CACACACACA TAAACACACA CATACACATG CAACACATAC ACACAAACAC 3300 

ATACATACAC ACACACAAAC ACATACACAC ATGCACACAC ACACACACAC ACACACACAC 3360 

AGAGTTTTTG GAAGAATATC TATGATGTAG GAAAAGGTTT GTTAGTACCT GCAACCTCTA 3420 

GGGCCTCATT TCCAAGCCTG CTGCTTCTCT TGCCAGGACT TATACGTGTG CAACCCCTGC 3480 

GTACACTACA GACCTAAGTA CATACTCGCT AACATGAACA CCCAGTAGGT ATTCTGTAAA 3540 

ACGGATATAC TTAAGTGCAT GTTATTGTCG TGTGTGCCGC GACTGAGCTG TGTCTTTGTC 3600 

ATGAAACTTT GCTTCTTTGC CAAATTAAGA CTCTGCCCTC CCTCCGAGGA AGGAAATATA 3660 

CAGCAGTTTC TGGAATCAGC TTCTCAAGTT TTCCCGCCTG TGTTGTGTTT ACTTGTGGCT 3720 

TTGCCTGAGA GTCTGTCCTC ACGTACACAC CTGTGAGTAT TTATTACAAG GCAGAACGGG 3780 

GACAGGCGTA GCCATAGTAC TTGTGTTTGA ATGTTTCCAG GGAGGGACAA TGGAGCCCCC 3840 

TCCTCAGTGA GGTGCTCACA GAATCTGTGT CAAATTCTGT GAACGGGGTC CAACACTAGT 3900 

ACCAGCTTCA CCCCAACCCT GAGTGCCCTG GAGCCACTGC CCTCTCTACT GCCTTGTGTG 3960 

TGCATGGGTG TGGTTGTGTG TGTCTCTGTT GGGGCATGTG CCGCATTTCG CAGTGGACTT 4020 

TGGTATCCAC ATGTAACCTT TGGCATCTGT GTTCCTGGAC CGCATTGACA GATCTGTGGT 4080 

GTGCAGATCT GTGTAGATGG GTCTGTGTGC AAAGTGGGGA GTCCCCTGAA AGTGACTATG 4140 

GATATTGTTG GTCAGCCAAA GACTTTCCTA TTCTTTGCTG CTTAAACTTT GTGCCTTAAT 4200 

ATTGTAAATA TAAACGGTTA AAACAAAAGT CGAC 4234 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1792 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
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(B) CLONE: HW042 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 8.. 889 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: complement (1409.. 1618) 
(D) OTHER INFORMATION: /label= SAC_24017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GTCGACC AAG GAT GGT GGA AGT TTC AGT GGT ACT GGT GGC CTT CCA GAT 49 
Lys Asp Gly Gly Ser Phe Ser Gly Thr Gly Gly Leu Pro Asp 
15 10 

TAT TCT GCC CCC AAT CCC ATT AAA GTG ACC CAT CGG TGC TAC ATC CTT 97 
Tyr Ser Ala Pro Asn Pro He Lys Val Thr His Arg Cys Tyr He Leu 
15 20 25 30 

GAG AAT GAC ACA GTC CAG TGT GAC TTG GAC CTG TAC AAG TCC CTG CAG 145 
Glu Asn Asp Thr Val Gin Cys Asp Leu Asp Leu Tyr Lys Ser Leu Gin 
35 40 45 

GCT TGG AAA GAC CAC AAA TTG CAC ATC GAC CAT GAG ATT GAA ACC CTG 193 
Ala Trp Lys Asp His Lys Leu His He Asp His Glu He Glu Thr Leu 
50 55 60 

CAG AAC AAA ATT AAG AAC CTT CGA GAA GTC AGG GGT CAC CTG AAG AAG 241 
Gin Asn Lys He Lys Asn Leu Arg Glu Val Arg Gly His Leu Lys Lys 
65 70 75 

AAG CGG CCA GAA GAA TGT GAC TGT CAC AAA GTC AGT TAC CAC AGC CAA 289 
Lys Arg Pro Glu Glu Cys Asp Cys His Lys Val Ser Tyr His Ser Gin 
80 85 90 

CAC AAA GGC CGC CTC AAG CAC AAG GGC TCC AGC CTG CAC CCT TTC AGG 337 
His Lys Gly Arg Leu Lys His Lys Gly Ser Ser Leu His Pro Phe Arg 
95 100 105 HO 

AAG GGT CTG CAG GAG AAA GAC AAG GTG TGG CTG TTG CGG GAG CAG AAA 385 
Lys Gly Leu Gin Glu Lys Asp Lys Val Trp Leu Leu Arg Glu Gin Lys 
115 120 125 

CGC AAG AAG AAA CTG CGC AAG CTG CTG AAG CGA CTG CAG AAC AAT GAC 433 
Arg Lys Lys Lys Leu Arg Lys Leu Leu Lys Arg Leu Gin Asn Asn Asp 
130 135 140 

ACG TGC AGC ATG CCG GGC CTC ACG TGC TTT ACT CAC GAC AAC CAC CAC 481 
Thr Cys Ser Met Pro Gly Leu Thr Cys Phe Thr His Asp Asn His His 
145 150 155 

TGG CAG ACC GCG CCG CTC TGG ACG TTG GGA CCC TTC TGC GCC TGC ACC 529 
Trp Gin Thr Ala Pro Leu Trp Thr Leu Gly Pro Phe Cys Ala Cys Thr 
160 165 170 

AGC GCC AAC AAT AAC ACG TAC TGG TGC CTG CGG ACC ATC AAC GAG ACC 577 
Ser Ala Asn Asn Asn Thr Tyr Trp Cys Leu Arg Thr He Asn Glu Thr 
I 75 180 185 190 

CAC AAC TTC CTC TTC TGT GAA TTC GCA ACT GGC TTC ATA GAA TAC TTT 625 
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His Asn Phe Leu Phe Cys Glu Phe Ala Thr Gly Phe He Glu Tyr Phe 
195 200 205 

GAC CTC AGT ACG GAC CCC TAC CAG CTG ATG AAC GCG GTC AAC ACA CTG 673 
Asp Leu Ser Thr Asp Pro Tyr Gin Leu Met Asn Ala Val Asn Thr Leu 
210 215 220 

GAC AGG GAC GTT CTC AAC CAG CTG CAC GTC CAG CTC ATG GAG CTA AGG 721 
Asp Arg Asp Val Leu Asn Gin Leu His Val Gin Leu Met Glu Leu Arg 
225 230 235 

AGC TGC AAA GGC TAC AAG CAG TGT AAC CCC CGG ACC CGC AAT ATG GAC 769 
Ser Cys Lys Gly Tyr Lys Gin Cys Asn Pro Arg Thr Arg Asn Met Asp 
240 245 250 

CTG GGG CTT AGA GAC GGA GGA AGC TAT GAG CAA TAC AGG CAG TTT CAG 817 
Leu Gly Leu Arg Asp Gly Gly Ser Tyr Glu Gin Tyr Arg Gin Phe Gin 
255 260 265 270 

CGT CGA AAA TGG CCA GAA ATG AAG AGA CCC TCT TCC AAA TCA CTG GGA 865 
Arg Arg Lys Trp Pro Glu Met Lys Arg Pro Ser Ser Lys Ser Leu Gly 
275 280 285 

CAG CTA TGG GAA GGT TGG GAA GGC TAATCGGCCA TGGAGAGGAA CCTCCAAAAC 919 
Gin Leu Trp Glu Gly Trp Glu Gly 
290 



CAGAGGCCTC 


CTGTGGCTGC 


CCAGGCGTGC AAAAATCACC 


CAATTCCGAG GAGATCGATG 


979 


CTGGAACTGG 


GAGACTTGAC 


AGGAGGCAGG GTCTGCACTT 


GGGACAGGAA ATCCCGAGGA 


1039 


GAACGCCAAG 


ACTTCCAGAG 


GCTCATTGTT TTGCCCCGCT 


TTGCTTCGGA TTGAACCTCA 


1099 


CCGGCCGCAC 


AAGGATGCGT 


CCTGACACCT GGAGTCTCCG 


CTCACCCTTT CAGAGGCTCA 


1159 


CCAAGACAAA 


GGAACTAATT 


CCCATGGACT TTCCTCCAGA GATGGAAATT GCTGGGATTT 


1219 


ACCCATCCCC 


TCCCCTGTAC 


CCCCTTCCCC ACTAGGGAAA 


CAAGCTTGTT TTAACCTTCT 


1279 


TATTCTTTGG 


AGAAAGCACG 


GATGTCCCAG GTGCTGTCAA CAGCGCAGTC TTGACAATGG 


1339 


TCTATAGCAC 


AAAACAGTAC 


CATTCACCTG GCTGGATGAG 


CCGGCTGCCC CAGAAGCTGC 


1399 


CTTCACTGTA 


CATATGTGAC 


TGCTCACATG TAACCAACAC 


AGGGACTTGT AGGGGGAATC 


1459 


TCACTAATAC 


GAAATCCCAT 


TTTCAAGAGT CGCGGTGTCA ATAAACACTC TGTGGCTGGT 


1519 


GTCAAGGGTA 


ATCCCTTGAG 


CTTTCAGACA TTTCTGTTCC 


TGCCCAGGAT TCGTTCCTTT 


1579 


GTTATCCATA 


TCCATCCCAG 


AACTGATGTT TTCTAAGGTA 


CTGAAACCCC AAGTTGATGT 


1639 


GTGTCCTGTG 


TTTTAATGAC 


ATTGTATTTG TAAAGCAATT 


TTGTAGTATA AAAGTACCAT 


1699 


CTTCCAGTGT 


TCACATCCCC 


CCCAGCCAAT GACTAGCTCT 


TGGTATGAAA AAAAAAATCT 


1759 


TTGAATTTTT 


GTAAAAAAAA 


AAAAAAAGTC GAC 




1792 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Lys Asp Gly Gly Ser Phe Ser Gly Thr Gly Gly Leu Pro Asp Tyr Ser 
15 10 15 

Ala Pro Asn Pro lie Lys Val Thr His Arg Cys Tyr lie Leu Glu Asn 
20 25 30 

Asp Thr Val Gin Cys Asp Leu Asp Leu Tyr Lys Ser Leu Gin Ala Trp 
35 40 45 

Lys Asp His Lys Leu His lie Asp His Glu lie Glu Thr Leu Gin Asn 
50 55 60 

Lys lie Lys Asn Leu Arg Glu Val Arg Gly His Leu Lys Lys Lys Arg 
65 70 75 *" 80 

Pro Glu Glu Cys Asp Cys His Lys Val Ser Tyr His Ser Gin His Lys 
85 90 95 

Gly Arg Leu Lys His Lys Gly Ser Ser Leu His Pro Phe Arg Lys Gly 
100 105 110 

Leu Gin Glu Lys Asp Lys Val Trp Leu Leu Arg Glu Gin Lys Arg Lys 
115 120 125 

Lys Lys Leu Arg Lys Leu Leu Lys Arg Leu Gin Asn Asn Asp Thr Cys 
130 135 140 

Ser Met Pro Gly Leu Thr Cys Phe Thr His Asp Asn His His Trp Gin 
145 150 155 160 

Thr Ala Pro Leu Trp Thr Leu Gly Pro Phe Cys Ala Cys Thr Ser Ala 
165 170 175 

Asn Asn Asn Thr Tyr Trp Cys Leu Arg Thr lie Asn Glu Thr His Asn 
180 185 190 

Phe Leu Phe Cys Glu Phe Ala Thr Gly Phe lie Glu Tyr Phe Asp Leu 
195 200 205 

Ser Thr Asp Pro Tyr Gin Leu Met Asn Ala Val Asn Thr Leu Asp Arg 
210 215 220 

Asp Val Leu Asn Gin Leu His Val Gin Leu Met Glu Leu Arg Ser Cys 
225 230 235 240 

Lys Gly Tyr Lys Gin Cys Asn Pro Arg Thr Arg Asn Met Asp Leu Gly 
245 250 255 

Leu Arg Asp Gly Gly Ser Tyr Glu Gin Tyr Arg Gin Phe Gin Arg Arg 
260 265 270 

Lys Trp Pro Glu Met Lys Arg Pro Ser Ser Lys Ser Leu Gly Gin Leu 
275 280 285 

Trp Glu Gly Trp Glu Gly 
290 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW043 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 8.. 634 

(ix) FEATURE: 

(A) NAME/ KEY: raisc_f eature 

(B) LOCATION: 582.. 808 

(D) OTHER INFORMATION: /label= SAC_24533 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GTCGACG GCT CCA CCA GGT TCA AGA CAC AAA CCT GTG CCT ACC AAG TCA 49 
Ala Pro Pro Gly Ser Arg His Lys Pro Val Pro Thr Lys Ser 
1 5 10 

ACA CCA GCT GGC ACA GAA CTA GGT AGC ACA GGA AGC GAC GGG GGA GAT 97 
Thr Pro Ala Gly Thr Glu Leu Gly Ser Thr Gly Ser Asp Gly Gly Asp 
15 20 25 30 

GCA GTA GAA TAC AAC CCC AAC CTC CTG GAT GAC CCA CAG TGG CCG TGT 145 
Ala Val Glu Tyr Asn Pro Asn Leu Leu Asp Asp Pro Gin Trp Pro Cys 
35 40 45 

GGG AAG CAC AAG CGT GTA CTC ATC TTT GCC TCA TAC ATG ACC ACT GTT 193 
Gly Lys His Lys Arg Val Leu He Phe Ala Ser Tyr Met Thr Thr Val 
50 55 60 

ATA GAG TAT GTG AAG CCT GCA GAC CTC AAA AAG GAC ATG AAC GAG ACC 241 
He Glu Tyr Val Lys Pro Ala Asp Leu Lys Lys Asp Met Asn Glu Thr 
65 70 75 

TTC AGG GAG AAG TTC CCT CAT ATC AAA CTG ACA TTG AGC AAA ATT AGG 289 
Phe Arg Glu Lys Phe Pro His He Lys Leu Thr Leu Ser Lys He Arg 
80 85 90 

AGT TTA AAA CGG GAG ATG CGG AAC CTT TCT GAA GAG TGC AGC TTG GAG 337 
Ser Leu Lys Arg Glu Met Arg Asn Leu Ser Glu Glu Cys Ser Leu Glu 
95 100 105 110 

CCT GTG ACT GTG TCC ATG GCC TAT GTG TAC TTT GAG AAG CTT GCA CTG 385 
Pro Val Thr Val Ser Met Ala Tyr Val Tyr Phe Glu Lys Leu Ala Leu 
115 120 125 

CAG GGC AAG CTC AAC AAA CAG AAC CGC AAA CTG TGT GCT GGC GCC TGT 433 
Gin Gly Lys Leu Asn Lys Gin Asn Arg Lys Leu Cys Ala Gly Ala Cys 
130 135 140 
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GTT CTG TTG GCT GCC AAG ATC AGC AGT GAC CTC CGC AAG AGT GAA GTG 481 
Val Leu Leu Ala Ala Lys lie Ser Ser Asp Leu Arg Lya Ser Glu Val 
145 150 155 

AAG CAG CTT ATT GAC AAG CTG GAG GAA AGG TTC CGG TTC AAC AGG AAG 529 
Lys Gin Leu lie Asp Lys Leu Glu Glu Arg Phe Arg Phe Asn Arg Lys 
160 165 170 

GAT CTC ATT GGA TTT GAG TTC ACG GTG CTT GTG GCT TTG GAA CTG GCC 577 
Asp Leu lie Gly Phe Glu Phe Thr Val Leu Val Ala Leu Glu Leu Ala 
175 180 185 190 

CTG TAC CTC CCG GAG AAC CAA GTA TTA CCT CAC TAC AGA CGC CTC ACC 625 
Leu Tyr Leu Pro Glu Asn Gin Val Leu Pro His Tyr Arg Arg Leu Thr 
195 200 205 

CAG CAG TTC TAGCCCAGAA CAGTGCACCC CACAGGGAGC CTGCTTTGGC 674 
Gin Gin Phe 



AGGAAGGCGA 


GTAGCTTGGC 


TAGAGCACTG 


ACTAGTAGTG 


TCCCTGTCCT 


TCCTCTAGTA 


734 


ACTGCACTTG 


CCCCTCAGTC 


CAGGGCTGCA 


CTTAGCTTCC 


CTTGGAAAGA 


GGCACTGCTC 


794 


TTATTTTGGA 


GTGTACCCCA 


TCTTCACCTT 


GAGGCCCTGT 


TGAGGGTTTT 


CCTGAAGCAA 


854 


ATGAACATCG 


TGGGGCTTGG 


TTTGCTTTTC 


ATGTGCCTGA 


GACCAGCCTG 


GTTTATGTTG 


914 


TCAATCCTGA 


GGTCCAGCTT 


TCATTTTGCC 


TAGCCATGAA 


ACCATTCCTG 


TAGAGACCTA 


974 


GGTGTCCAGA 


CCTTCTGAAC 


ATACTGGAAT 


TGCCAGGCTT 


CTGTCTCCTG 


GTGCCTCCAC 


1034 


CCAGGCAGAG 


CTCCCCCATC 


AGCACTTTTT 


TTTGGTCAAG 


ATGCAGGAAA 


GGGCAGTCTG 


1094 


CTCACACTGG 


ACCTTGTTAA 


AGCATCTGCC 


ACTGTGAAAC 


CCCTGTGCCA 


CCACATACAG 


1154 


TGTTTGGCCA 


ACACCACCAC 


CCACCCCAAC 


CCCTGCCTCC 


TGCCAGCACT 


TATCCCTGCG 


1214 


GTACTGGACC 


ATTTAAGACA 


ACCGCTGACT 


GCGGGCTCTT 


GACTTTCCTG 


AGCAATATCA 


1274 


TGATGGAGAA 


AAGGTATTAA 


CTTGAGATTT 


GGTTTCTATA 


ACAGAAAAGA 


CGGCCCATTA 


1334 


AGGCCTGCTG 


CTGACTCTGC 


ACCACACTTG 


ATCCTCAAGC 


AGGAAGGTAC 


TGACCCTGCC 


1394 


ACCTAACAGC 


ACATCAGTGA 


CTCCTCACAA 


ATAGATGCCA 


TCCAGGCCAA 


CTTGCCTGTG 


1454 


TATTGACTGT 


GCCACTGTCT 


TCCTACTGGC 


ACCAGACAAG 


AGCCATCCTC 


CACCTTCCCA 


1514 


GGCCTGTTAT 


TCCCTACAAC 


TGGTCACCAG 


CCTGGCAGAT 


GCCAGTCCCA 


CTGCTCTCAA 


1574 


GTGCTTGGTT 


CCACAGCCAC 


TCATCCCAGA 


AATGCTTACC 


CTCTCAAGCC 


TGACACCACT 


1634 


CAACTACCTA 


ATGGATGTTA 


GTAGCTTAAA 


CAGAGACCTT 


GAAGGATAGT 


AGAAGGATAG 


1694 


TAAACTCTAG 


CCTTCTACCC 


TGCCCATGCT 


TATGCTGGTG 


TAAGCACGCT 


ATCCTTAGAG 


1754 


CACCTGGGTG 


CCATCTGCAG 


ACAGTGATGT 


CTTACTTAAA 


ATGAACTTTA 


GATACAGCTG 


1814 


ACTGGTCGGC 


TATCCCTGGG 


GGTGATACCC 


TGGTAGCTTA 


TTCTAGTTAT 


GTTAAAGCTG 


1874 


GCAACAGTGA 


TAATTTAAGC 


TTCTATATTT 


ATGATCTTAC 


CAAGCTGGTG 


TGCTGTTAGT 


1934 
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CCTTCCCTCC TTGCCCTCAG TAATGCTAAG AGCTGCTGAC ATGTGGTATG ATAAGCACTG 1994 

TTACAACAAA ACCAACCTTT AAACCAAATG ACAGGCTAAA CACAGTGTCT GTCCTAGACT 2054 

AAGTCCAGAA ATCCGAGGTG TGCCCACGGT GTCCAGCTGG ACTGTGACCC AATGCCACAG 2114 

ACCTGGTGCT GTACCTTCAG CACTCAACAG GCACAGCTGT GTGTGTGACC TCAGAATGCT 2174 

GCTTATGTAT GTCCAGCCGG TTCTGAATGG AAGGTGTTAA ATGTTTTTGT TAAAAGTAAA 2234 

ACTTGTTTTT GCAAAAAAAA AGTCGAC 2261 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Ala Pro Pro Gly Ser Arg His Lys Pro Val Pro Thr Lys Ser Thr Pro 
15 10 15 

Ala Gly Thr Glu Leu Gly Ser Thr Gly Ser Asp Gly Gly Asp Ala Val 
20 25 30 

Glu Tyr Asn Pro Asn Leu Leu Asp Asp Pro Gin Trp Pro Cys Gly Lys 
35 40 45 

His Lys Arg Val Leu He Phe Ala Ser Tyr Met Thr Thr Val He Glu 
50 55 60 

Tyr Val Lys Pro Ala Asp Leu Lys Lys Asp Met Asn Glu Thr Phe Arg 
65 70 75 80 

Glu Lys Phe Pro His He Lys Leu Thr Leu Ser Lys He Arg Ser Leu 
85 90 95 

Lys Arg Glu Met Arg Asn Leu Ser Glu Glu Cys Ser Leu Glu Pro Val 
100 105 HO 

Thr Val Ser Met Ala Tyr Val Tyr Phe Glu Lys Leu Ala Leu Gin Gly 
115 120 125 

Lys Leu Asn Lys Gin Asn Arg Lys Leu Cys Ala Gly Ala Cys Val Leu 
130 135 140 

Leu Ala Ala Lys He Ser Ser Asp Leu Arg Lys Ser Glu Val Lys Gin 
I 45 150 155 160 

Leu He Asp Lys Leu Glu Glu Arg Phe Arg Phe Asn Arg Lys Asp Leu 
165 170 175 

He Gly Phe Glu Phe Thr Val Leu Val Ala Leu Glu Leu Ala Leu Tyr 
180 185 190 

Leu Pro Glu Asn Gin Val Leu Pro His Tyr Arg Arg Leu Thr Gin Gin 
195 200 205 
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Phe 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW044 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 24.. 1100 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: complement (1161.. 1378) 

(D) OTHER INFORMATION: /label= SAC 24216 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GTCGACCTCC ATAGAGTTCC GGC ATG AAG CCA CCG CAG AGA CGG CGG AAG 50 

Met Lys Pro Pro Gin Arg Arg Arg Lys 
1 5 

GTC CCC GCA CGC TAC ACA GGT GAA GCT ACC GGC CCC ACA GCT TGG AGC 98 
Val Pro Ala Arg Tyr Thr Gly Glu Ala Thr Gly Pro Thr Ala Trp Ser 
10 15 20 25 

CCC CGC GAG ATG CGG CAC CTA CTG AGA CTA CTC CAG GCT CGG CGC GGC 146 
Pro Arg Glu Met Arg His Leu Leu Arg Leu Leu Gin Ala Arg Arg Gly 
30 35 40 

CAG CCG GAG CCA GAC GCC AAG GAG CTG GCC AAG GAG CTG CGA GGC CGG 194 
Gin Pro Glu Pro Asp Ala Lys Glu Leu Ala Lys Glu Leu Arg Gly Arg 
45 50 55 

AGC GAG GCC GAG ATC TGT CGC TTT ATC CAG CAG CTC AAA GGC CGG GTG 242 
Ser Glu Ala Glu lie Cys Arg Phe lie Gin Gin Leu Lys Gly Arg Val 
60 65 70 

GTT CGA GAG GCT ATT CGG AAG GTG CAA CCA GGT GGC AGA GAT GGT CCA 290 
Val Arg Glu Ala lie Arg Lys Val Gin Pro Gly Gly Arg Asp Gly Pro 
75 80 85 

AGG CAT CGA GAG ACA CAG CTC CCA GCC CCC ATA GAG GTA TGG ATG GAT 338 
Arg His Arg Glu Thr Gin Leu Pro Ala Pro lie Glu Val Trp Met Asp 
90 95 100 105 

CTT GCT GAG AAA TTA ACA GGC CCA CTG GAG GAA GCC CTG ACT GCA GCT 386 
Leu Ala Glu Lys Leu Thr Gly Pro Leu Glu Glu Ala Leu Thr Ala Ala 
110 115 120 
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TTC TCC CAG GTA CTC ACC ATT GCT GCT GCG GAA CCT CTC AGC CTC CTG 434 
Phe Ser Gin Val Leu Thr lie Ala Ala Ala Glu Pro Leu Ser Leu Leu 
125 130 135 

CAT TCC AAG CCA GGC AAG CCC ACA AAG GCC TGT GGA AGG GCA CTG GTG 482 
His Ser Lys Pro Gly Lys Pro Thr Lys Ala Cys Gly Arg Ala Leu Val 
140 145 150 

TTC TTG AGC ACC CAA GAT GCG CAG AAG GAT TCT GCC CCT GAA GGT TCT 530 
Phe Leu Ser Thr Gin Asp Ala Gin Lys Asp Ser Ala Pro Glu Gly Ser 
155 160 165 

GGG CCT GAA CCT ATG ACT GCT GCT GAC CCC ACT CCT GAG GCC TCT GTC 578 
Gly Pro Glu Pro Met Thr Ala Ala Asp Pro Thr Pro Glu Ala Ser Val 
170 175 180 185 

CCT GAA CCT AAG GCC TCC GGT ACC AAC CCT GAG ACC TCT GGC CTT GTC 626 
Pro Glu Pro Lys Ala Ser Gly Thr Asn Pro Glu Thr Ser Gly Leu Val 
190 195 200 

CCA GAG GTC TCT GTC CCT GAC CCT GAT GCA CCA ACT GAG TCC CTG GCT 674 
Pro Glu Val Ser Val Pro Asp Pro Asp Ala Pro Thr Glu Ser Leu Ala 
205 210 215 

GGA TCC TCC ACA GAG AGA GAC TTT GCC GTG GAC TTT GAG AAG ATC TAC 722 
Gly Ser Ser Thr Glu Arg Asp Phe Ala Val Asp Phe Glu Lys He Tyr 
220 225 230 

AAA TAC CTG TCC TCC TCC TCC AGA GGT GGC CAT GGC CCT GAG CTC TCA 770 
Lys Tyr Leu Ser Ser Ser Ser Arg Gly Gly His Gly Pro Glu Leu Ser 
235 240 245 

GCA GTT GAG TCC GCC GTG GTC CTT AAC CTG CTC ATG TCA CTT GCA GAG 818 
Ala Val Glu Ser Ala Val Val Leu Asn Leu Leu Met Ser Leu Ala Glu 
250 255 260 265 

GAA CTG CCC CAC CTG CCT TGC ACA GCC CTG GTG GAC CAT CTG ACT AAA 866 
Glu Leu Pro His Leu Pro Cys Thr Ala Leu Val Asp His Leu Thr Lys 
270 275 280 

ACA TAT GCT CAA CTG ACG GCT CCC CAG GCC TCT CTC GAT AGA GAA AAG 914 
Thr Tyr Ala Gin Leu Thr Ala Pro Gin Ala Ser Leu Asp Arg Glu Lys 
285 290 295 

AGG CCC AGG CCG GGG ACT GAA GAT GGA GGG ACT GAC TCC ACG GGG CCA 962 
Arg Pro Arg Pro Gly Thr Glu Asp Gly Gly Thr Asp Ser Thr Gly Pro 
300 305 310 

GAA GAG ACT GGC CAA GGC AGT CCT CAA GCT TCT GAG CCC ACA GAA CCG 1010 
Glu Glu Thr Gly Gin Gly Ser Pro Gin Ala Ser Glu Pro Thr Glu Pro 
315 320 325 

AGG CTG ACC TGG AAA GCA GCT GGG GTC TGC CCA CTG AAC CCA TTC CTG 1058 
Arg Leu Thr Trp Lys Ala Ala Gly Val Cys Pro Leu Asn Pro Phe Leu 
330 335 340 345 

GTG CCC CTG GAC CTC CTG AGC CAG GCC CTC GCC CCT GCC AGG 1100 
Val Pro Leu Asp Leu Leu Ser Gin Ala Leu Ala Pro Ala Arg 
350 355 

TGAAGGATGC AGCAGGCTGG CTATGCTGGT TTCTTTCTAC ATCGTCTGTC CAGCACTGGT 1160 

ACTGGTGTCG GGAACGTGAT TGTGGCTTCC TGACAGTGTC CGGCCATAGG CAAGCCCTTC 1220 
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AGACACTGCA TGCACTAGTA GCCTGGAAAG TGAAGCACAG TTGAAGGAAG GGGTTTAACT 1280 

ATTGGGGATT TGAGACTGCA CAAAGGCGGA ATTCCTGGGC TGGTAGATTG CGGCAAGTGT 1340 

CCCTGGCCCC GCTGTGCCTG GTTCTCTGCC ATAAGAGTAC CACATCATCA TCCCTTTGTG 1400 

CATGGCACTC GAACCCACCC CCAGCACACC CTCCCCACCT CAACTGACCA CTGTACCATC 1460 

CTCTTCGCCC TTCTCTGGGT GTCCTGTGAC CCTCAATAAA GCTGTCATCT AACTAGCTGT 1520 

GAAAAAAAAA AAGTCGAC 1538 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Lys Pro Pro Gin Arg Arg Arg Lys Val Pro Ala Arg Tyr Thr Gly 
1 5 10 15 

Glu Ala Thr Gly Pro Thr Ala Trp Ser Pro Arg Glu Met Arg His Leu 
20 25 30 

Leu Arg Leu Leu Gin Ala Arg Arg Gly Gin Pro Glu Pro Asp Ala Lys 
35 40 45 

Glu Leu Ala Lys Glu Leu Arg Gly Arg Ser Glu Ala Glu He Cys Arg 
50 55 60 

Phe He Gin Gin Leu Lys Gly Arg Val Val Arg Glu Ala He Arg Lys 
65 70 75 80 

Val Gin Pro Gly Gly Arg Asp Gly Pro Arg His Arg Glu Thr Gin Leu 
85 90 " 95 

Pro Ala Pro He Glu Val Trp Met Asp Leu Ala Glu Lys Leu Thr Glv 
100 105 no 

Pro Leu Glu Glu Ala Leu Thr Ala Ala Phe Ser Gin Val Leu Thr He 
115 120 125 

Ala Ala Ala Glu Pro Leu Ser Leu Leu His Ser Lys Pro Gly Lys Pro 
130 135 140 

Thr Lys Ala Cys Gly Arg Ala Leu Val Phe Leu Ser Thr Gin Asp Ala 
145 150 155 160 

Gin Lys Asp Ser Ala Pro Glu Gly Ser Gly Pro Glu Pro Met Thr Ala 
165 170 175 

Ala Asp Pro Thr Pro Glu Ala Ser Val Pro Glu Pro Lys Ala Ser Glv 
180 185 190 

Thr Asn Pro Glu Thr Ser Gly Leu Val Pro Glu Val Ser Val Pro Asp 

200 205 
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Pro Asp Ala Pro Thr Glu Ser Leu Ala Gly Ser Ser Thr Glu Arg Asp 
210 215 220 

Phe Ala Val Asp Phe Glu Lys lie Tyr Lys Tyr Leu Ser Ser Ser Ser 
225 230 235 240 

Arg Gly Gly His Gly Pro Glu Leu Ser Ala Val Glu Ser Ala Val Val 
245 250 255 

Leu Asn Leu Leu Met Ser Leu Ala Glu Glu Leu Pro His Leu Pro Cys 
260 265 270 

Thr Ala Leu Val Asp His Leu Thr Lys Thr Tyr Ala Gin Leu Thr Ala 
275 280 285 

Pro Gin Ala Ser Leu Asp Arg Glu Lys Arg Pro Arg Pro Gly Thr Glu 
290 295 300 

Asp Gly Gly Thr Asp Ser Thr Gly Pro Glu Glu Thr Gly Gin Gly Ser 
305 310 315 320 

Pro Gin Ala Ser Glu Pro Thr Glu Pro Arg Leu Thr Trp Lys Ala Ala 
325 330 335 

Gly Val Cys Pro Leu Asn Pro Phe Leu Val Pro Leu Asp Leu Leu Ser 
340 345 350 

Gin Ala Leu Ala Pro Ala Arg 
355 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW045 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 9.. 1628 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1973.. 2134 

(D) OTHER INFORMATION: /label= SAC_24028 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GTCGACCA GGG CGT GAT GGC TCC TAC TAC CAG CTC TTT GTT CAT GGT GGC 50 
Gly Arg Asp Gly Ser Tyr Tyr Gin Leu Phe Val His Gly Gly 
15 10 
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CGC CTC CAG CCA GTC CTA AGO CAG AAA GCC TGC CGA GGC ATG AAC TGG 98 
Arg Leu Gin Pro Val Leu Arg Gin Lys Ala Cys Arg Gly Met Asn Trp 
15 20 25 30 

ATA GCT GGG CTT CGG ATG GTG CCC GAT GGC AGT ATG GTC GTC TTG GGT 146 
lie Ala Gly Leu Arg Met Val Pro Asp Gly Ser Met Val Val Leu Gly 

35 40 45 • 

TTC CAT GCC AAC GAG TTT GTA GTG TGG AGC CCG CGG TCC CAT GAG AAG 194 
Phe His Ala Asn Glu Phe Val Val Trp Ser Pro Arg Ser His Glu Lvs 
50 55 60 

CTG CAC ATC GTC AAC TGC GGG GGA GGG CAC CGC TCC TGG GCC TTT TCT 242 
Leu His lie Val Asn Cys Gly Gly Gly His Arg Ser Trp Ala Phe Ser 
65 70 75 

GAT ACT GAG GCA GCC ATG GCC TTT GCC TAC CTT AAG GAT GGT GAT GTC 290 
Asp Thr Glu Ala Ala Met Ala Phe Ala Tyr Leu Lys Asp Gly Asp Val 
80 85 90 

ATG CTC TAC CGG GCT CTA GGA GGC TGC ATC AGG CCA AAC GTG ATT CTC 338 
Met Leu Tyr Arg Ala Leu Gly Gly Cys He Arg Pro Asn Val He Leu 
95 100 105 no 

CGG GAG GGT CTG CAT GGC CGG GAA ATC ACA TGT GTA AAG CGT GTG GGC 386 
Arg Glu Gly Leu His Gly Arg Glu He Thr Cys Val Lys Arg Val Gly 
"5 120 125 

££ T H T ? ™E C CTG 000 CCT GAA m GAG GTA CCC AAC TTG GAG CAT CCT 434 
Thr Val Thr Leu Gly Pro Glu Phe Glu Val Pro Asn Leu Glu His Pro 
130 135 140 

GAC TCC CTG GAG CCT GGC AGT GAG GGG CCT GGT CTG ATT GAC ATC GTG 482 
Asp Ser Leu Glu Pro Gly Ser Glu Gly Pro Gly Leu He Asp He Val 
145 150 155 

A ? A £5* 5? C AGT GAG GAC ACT ACT GTC TGT GTG CTA GCA CTT CCC ACC 530 
He Thr Gly Ser Glu Asp Thr Thr Val Cys Val Leu Ala Leu Pro Thr 
160 165 170 

ACC ACA GGC GCA GCC CAC GCC CTC ACT GCT GTC TGT AAC CAT ATC TCC 578 
Thr Thr Gly Ala Ala His Ala Leu Thr Ala Val Cys Asn His lie Ser 
175 180 185 190 

TCC GTG CGA GCC CTG GCA GTG TGG GGT GTT GGC ACC CCA GGT GGC CCA 626 
Ser Val Arg Ala Leu Ala Val Trp Gly Val Gly Thr Pro Gly Gly Pro 
195 200 205 

CAG GAT TCT CAC CCA GGG CTC ACG GCT CAG GTA GTG TCT GCA GGG GGT 674 
Gin Asp Ser His Pro Gly Leu Thr Ala Gin Val Val Ser Ala Gly Gly 
210 215 220 

CGA GCC GAG ATG CAC TGC TTC AGC CTC ATG ATC ACT CCG GAT GCC AGC 722 
Arg Ala Glu Met His Cys Phe Ser Leu Met He Thr Pro Asp Ala Ser 
225 230 235 

ACC CCA AGC CGC CTT GCC TGC CAC ATC ATG CAC CTT TCA TCC CAC CGG 770 
Thr Pro Ser Arg Leu Ala Cys His He Met His Leu Ser Ser His Arg 
240 245 250 

CTA GAT GAG TAC TGG GAC CGG CAG CGC AAC CGG CAC CGG ATG ATC AAG 818 
Leu Asp Glu Tyr Trp Asp Arg Gin Arg Asn Arg His Arg Met He Lys 
255 260 265 270 
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GTG GAC CCT GAG ACC AGG TAC ATG TCT CTT GCC ATT TGT GAG CTT GAC 866 
Val Asp Pro Glu Thr Arg Tyr Met Ser Leu Ala He Cys Glu Leu Asp 
275 280 285 

TCC GAT AGG CCT GGC CTT GGC CCC GGC CCC CTT GTG GCT GCA GCC TGT 914 
Ser Asp Arg Pro Gly Leu Gly Pro Gly Pro Leu Val Ala Ala Ala Cys 
290 295 300 

AGT GAC GGG GCA GTG AGG CTT TTT CTC TTG CAG GAC TCT GGG CGA ATT 962 
Ser Asp Gly Ala Val Arg Leu Phe Leu Leu Gin Asp Ser Gly Arg He 
305 310 315 

CTG CAT CTC CTT GCT GAG ACT TTC CAC CAT AAG CGG TGT GTC CTC AAG 1010 
Leu His Leu Leu Ala Glu Thr Phe His His Lys Arg Cys Val Leu Lys 
320 325 330 

GTC CAT TCG TTC ACA CAT GAG GCA CCC AAC CAG CGT CGG AGG CTG ATC 1058 
Val His Ser Phe Thr His Glu Ala Pro Asn Gin Arg Arg Arg Leu He 
335 340 345 350 

CTG TGC AGT GCA GCT ACA GAT GGC AGC ATA GCC TTC TGG GAT CTC ACC 1106 
Leu Cys Ser Ala Ala Thr Asp Gly Ser He Ala Phe Trp Asp Leu Thr 
355 360 365 

ACG GCA ATG GAC CGA GGC TCT ACT ACC CTG GAG CCT CCA GCA CAC CCT 1154 
Thr Ala Met Asp Arg Gly Ser Thr Thr Leu Glu Pro Pro Ala His Pro 
370 375 380 

GGG CTT CCC TAC CAG ATG GGC ACC CCC TGC CTG ACC GTC CAG GCC CAT 1202 
Gly Leu Pro Tyr Gin Met Gly Thr Pro Cys Leu Thr Val Gin Ala His 
385 390 395 

AGC TGT GGT GTC AAC AGC CTG CAC ACT TTG CCT ACA CCT GAG GGC CAT 1250 
Ser Cys Gly Val Asn Ser Leu His Thr Leu Pro Thr Pro Glu Gly His 
400 405 410 

CAT CTT GTG GCC AGT GGC AGT GAG GAC GGG TCC CTG CAT GTC TTC ACG 1298 
His Leu Val Ala Ser Gly Ser Glu Asp Gly Ser Leu His Val Phe Thr 
415 420 425 430 

CTT GCT GTG AAG ATG CCA GAG CTG GAA GAG GCT GAT GGG GAG GCT GAG 1346 
Leu Ala Val Lys Met Pro Glu Leu Glu Glu Ala Asp Gly Glu Ala Glu 
435 440 " 445 

TTG GTG CCC CAG TTA TGT GTC CTA GAT GAA TAT TCT GTC CCC TGT GCA 1394 
Leu Val Pro Gin Leu Cys Val Leu Asp Glu Tyr Ser Val Pro Cys Ala " " 

450 455 46Q 

CAT GCT GCC CAT GTG ACG GGC ATC AAG ATC CTA AGT CCC AAG CTG ATG 1442 
His Ala Ala His Val Thr Gly He Lys He Leu Ser Pro Lys Leu Met 
465 470 475 

GTC TCA GCC TCC ATA GAC CAA CGG CTG ACC TTC TGG CGT CTT GGA AAT 1490 

ffo Ala Ser Ile ^ Gln Leu Thr Phe Tr p Arg Leu Gly Asn 
480 485 490 

GGT GAG CCC ACC TTC ATG AAT AGC ACT GTG TAC CAC GTG CCA GAT GTG 1538 
Gly Glu Pro Thr Phe Met Asn Ser Thr Val Tyr His Val Pro Asp Val 
495 500 505 510 

GCT GAC ATG GAC TGC TGG CCT GTG AAC CCT GAG TTT GGC CAC CGC TGT 1586 
Ala Asp Met Asp Cys Trp Pro Val Asn Pro Glu Phe Gly His Arg Cys 
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515 520 525 

GCT CTT GCG GGT CAG GGA CTC GAG GTT TAC AAC TGG TAT GAC 1628 
Ala Leu Ala Gly Gin Gly Leu Glu Val Tyr Asn Trp Tyr Asp 
530 535 540 

TGAGTTACCA CAGTAGTTGG AGGACTGAGC CTGGGGCTTG ACCACAGACA GCGGAGCAGG 1688 

GATCAGCTGT CTGTGTCATG CTCTATGTGC TCTGAGGAGG TGAGGCAGTA CCATGGGTTC 1748 

CTGTAGGGTG TTGCATAGGT GGACCAAGAA TATGCCTCAC TCTTCACAAT AGGATGAAAC 1808 

TGTATTTATT CTGACTTTAA GTGCCCAACA TCTGTGAGGT TTCTGTGAGG TCTTGTTTTT 1868 

TTCCCAGTTG ATGCTTTTAT AAACATTCCC AGCTATTGGG CCCTTAGATG TGGCTCAGCG 1928 

GAGGGAGGCC CAGCATGGCC AAGCCTGTGT GGAACACCTC ACGTACTGCC CTCAAAAGCT 1988 

GTAGGCGAGC AAACATCTGA CCAAAGAGGT GTGGCCGAGG TTCCTGAAAG AAAAGCAACC 2048 

AGGCCCATCC TCATTTCCCG AGCCTGAGCC CTTGTCCATA TTTCCCACAG ACCCTCCCTT 2108 

GCTGTATGCT CACCCCTAGA ATGTGTACGC GGTTATAGTA TGAGCTGAAA TCCATGCTGA 2168 

GCTGCACCAG GAACTTGCAC ACCTAGAGAC AGAGGTTGGA TCGTTGAGCT GTTGTCTTTT 2228 

TTCTTGTGTC ACAACCCAGA ATAAAGAATA GTGTGTGTAG TGTCAAAAA 2277 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Gly Arg Asp Gly Ser Tyr Tyr Gin Leu Phe Val His Gly Gly Arg Leu 
1 5 10 15 

Gin Pro Val Leu Arg Gin Lys Ala Cys Arg Gly Met Asn Trp He Ala 
20 25 30 

Gly Leu Arg Met Val Pro Asp Gly Ser Met Val Val Leu Gly Phe His 
35 40 45 

Ala Asn Glu Phe Val Val Trp Ser Pro Arg Ser His Glu Lys Leu His 
50 55 60 

lie Val Asn Cys Gly Gly Gly His Arg Ser Trp Ala Phe Ser Asp Thr 
65 ™ 75 80 

Glu Ala Ala Met Ala Phe Ala Tyr Leu Lys Asp Gly Asp Val Met Leu 
85 90 95 

Tyr Arg Ala Leu Gly Gly Cys He Arg Pro Asn Val He Leu Arg Glu 
100 105 no 

Gly Leu His Gly Arg Glu He Thr Cys Val Lys Arg Val Gly Thr Val 
115 120 125 
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Thr Leu Gly Pro Glu Phe Glu Val Pro Asn Leu 
130 135 



Glu His Pro Asp Ser 
140 



Leu Glu Pro Gly Ser Glu Gly Pro Gly Leu lie 
145 150 155 



Asp He Val He Thr 
160 



Gly Ser Glu Asp Thr Thr Val Cys Val Leu Ala 
165 170 

Gly Ala Ala His Ala Leu Thr Ala Val Cys Asn 
180 185 

Arg Ala Leu Ala Val Trp Gly Val Gly Thr Pro 
195 200 



Leu Pro Thr Thr Thr 
175 

His He Ser Ser Val 
190 

Gly Gly Pro Gin Asp 
205 



Ser His Pro Gly Leu Thr Ala Gin Val Val Ser 
210 215 



Ala Gly Gly Arg Ala 
220 



Glu Met His Cys Phe Ser Leu Met He Thr Pro 
225 230 235 

Ser Arg Leu Ala Cys His He Met His Leu Ser 
245 250 

Glu Tyr Trp Asp Arg Gin Arg Asn Arg His Arg 
260 265 

Pro Glu Thr Arg Tyr Met Ser Leu Ala He Cys 
275 280 



Asp Ala Ser Thr Pro 
240 

Ser His Arg Leu Asp 
255 

Met He Lys Val Asp 
270 

Glu Leu Asp Ser Asp 
285 



Arg Pro Gly Leu Gly Pro Gly Pro Leu Val Ala 
290 295 

Gly Ala Val Arg Leu Phe Leu Leu Gin Asp Ser 
305 310 315 

Leu Leu Ala Glu Thr Phe His His Lys Arg Cys 
325 330 

Ser Phe Thr His Glu Ala Pro Asn Gin Arg Arg 
340 345 

Ser Ala Ala Thr Asp Gly Ser He Ala Phe Trp 
355 360 



Ala Ala Cys Ser Asp 
300 

Gly Arg He Leu His 
320 

Val Leu Lys Val His 
335 

Arg Leu He Leu Cys 
350 

Asp Leu Thr Thr Ala 
365 



Met Asp Arg Gly Ser Thr Thr Leu Glu Pro Pro 
370 375 

Pro Tyr Gin Met Gly Thr Pro Cys Leu Thr Val 
385 390 395 

Gly Val Asn Ser Leu His Thr Leu Pro Thr Pro 
405 410 



Ala His Pro Gly Leu 
380 

Gin Ala His Ser Cys 
400 

Glu Gly His His Leu 
415 



Val Ala Ser Gly Ser Glu Asp Gly Ser Leu His 
420 425 

Val Lys Met Pro Glu Leu Glu Glu Ala Asp Gly 
435 440 

Pro Gin Leu Cys Val Leu Asp Glu Tyr Ser Val 
450 455 



Val Phe Thr Leu Ala 
430 

Glu Ala Glu Leu Val 
445 

Pro Cys Ala His Ala 
460 
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Ala His Val Thr Gly lie Lys He Leu Ser Pro Lys Leu Met Val Ser 
465 470 475 480 

Ala Ser He Asp Gin Arg Leu Thr Phe Trp Arg Leu Gly Asn Gly Glu 
485 490 495 

Pro Thr Phe Met Asn Ser Thr Val Tyr His Val Pro Asp Val Ala Asp 
500 505 510 

Met Asp Cys Trp Pro Val Asn Pro Glu Phe Gly His Arg Cys Ala Leu 
515 520 525 

Ala Gly Gin Gly Leu Glu Val Tyr Asn Trp Tyr Asp 
530 535 540 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW046 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 9.. 1160 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: complement (1523.. 1760) 
(D) OTHER INFORMATION: /label= SAC_24036 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GTCGACGT GAC TTC CTT GTC AGG TTC ACC CCC GCT GCA GCA GGG CTC TGC 50 
Asp Phe Leu Val Arg Phe Thr Pro Ala Ala Ala Gly Leu Cys 
15 10 

TCT CTG ACT GAG TGG TGC TGT GTT ACA TGG ATC TGT CAT TTC TAT TCA 98 
Ser Leu Thr Glu Trp Cys Cys Val Thr Trp He Cys His Phe Tyr Ser 
15 20 25 30 

GTC ATC AGT TGG CAT TGC AGT AAC ACC TAT GGC AAT GAA GAA AGA GAC 146 
Val He Ser Trp His Cys Ser Asn Thr Tyr Gly Asn Glu Glu Arg Asp 
35 40 45 

AGA ATT TGG GAT TTT CAT CTA AGG AAA ATG ATA AAT AAC AAC CAG ACG 194 
Arg He Trp Asp Phe His Leu Arg Lys Met He Asn Asn Asn Gin Thr 
50 55 60 

TGT GCT GCA GGG CAG GAC TCC GTG CCC TAC GTG ACC TGT ATG CTT CAC 242 
Cys Ala Ala Gly Gin Asp Ser Val Pro Tyr Val Thr Cys Met Leu His 
65 70 75 
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GTG CTG GAA GAG TGG TTG GGT GTG GAA CAG TTG GAG GAC TAC CTG AAT 290 
Val Leu Glu Glu Trp Leu Gly Val Glu Gin Leu Glu Asp Tyr Leu Asn 
80 85 90 

TTT GCG AAC CAT CTC TTG TGG GTC TTC ACC CCG CTG ATC CTT TTA ATA 338 
Phe Ala Asn His Leu Leu Trp Val Phe Thr Pro Leu lie Leu Leu lie 
95 100 105 110 

CTT CCG TAC TTT ACC ATC TTC CTT CTC TAC CTT ACT ATT ATT TTC CTC 386 
Leu Pro Tyr Phe Thr He Phe Leu Leu Tyr Leu Thr He He Phe Leu 
115 120 125 

CAC ATC TAT AAG AGG AAG AAT GTG TTA AAA GAA GCC TAC TCT CAC AAC 434 
His He Tyr Lys Arg Lys Asn Val Leu Lys Glu Ala Tyr Ser His Asn 
130 135 140 

TTG TGG GAT GGC GCG AGG AAA ACA GTG GCA ACC CTG TGG GAT GGA CAT 482 
Leu Trp Asp Gly Ala Arg Lys Thr Val Ala Thr Leu Trp Asp Gly His 
145 150 155 

GCG GCG GTT TGG CAT GGT TAT GAA GTT CAC GGG ATG GAA AAG ATA CCA 530 
Ala Ala Val Trp His Gly Tyr Glu Val His Gly Met Glu Lys He Pro 
160 165 170 

GAA GGA CCA GCA CTT ATA ATT TTT TAT CAT GGA GCT ATT CCC ATA GAC 578 
Glu Gly Pro Ala Leu He He Phe Tyr His Gly Ala He Pro He Asp 
175 180 185 190 

TTT TAC TAC TTC ATG GCT AAA ATT TTC ATC CAC AAA GGC AGA ACT TGC 626 
Phe Tyr Tyr Phe Met Ala Lys He Phe He His Lys Gly Arg Thr Cys 
195 200 205 

CGA GTG GTA GCT GAC CAC TTT GTC TTT AAA ATC CCA GGG TTC AGT TTA 674 
Arg Val Val Ala Asp His Phe Val Phe Lys He Pro Gly Phe Ser Leu 
210 215 220 

TTA CTT GAT GTA TTT TGT GCT CTT CAT GGA CCA AGA GAA AAA TGT GTT 722 
Leu Leu Asp Val Phe Cys Ala Leu His Gly Pro Arg Glu Lys Cys Val 
225 230 235 

GAA ATC TTG AGG AGT GGT CAC TTG TTA GCT ATT TCA CCG GGT GGA GTT 770 
Glu He Leu Arg Ser Gly His Leu Leu Ala He Ser Pro Gly Gly Val 
240 245 250 

CGA GAA GCC TTA CTT AGT GAT GAA ACC TAC AAC ATC ATA TGG GGT AAT 818 
Arg Glu Ala Leu Leu Ser Asp Glu Thr Tyr Asn He He Trp Gly Asn 
255 260 265 270 

CGT AAA GGC TTT GCT CAG GTT GCA ATC GAT GCA AAA GTG CCC ATT ATT 866 
Arg Lys Gly Phe Ala Gin Val Ala He Asp Ala Lys Val Pro He He 
275 280 285 

CCT ATG TTT ACA CAA AAC ATC CGA GAA GGA TTT AGA TCA CTC GGA GGA 914 
Pro Met Phe Thr Gin Asn He Arg Glu Gly Phe Arg Ser Leu Gly Gly 
290 295 300 

ACA AGA TTG TTT AAA TGG CTT TAT GAA AAA TTC CGC TAT CCG TTT GCT 962 
Thr Arg Leu Phe Lys Trp Leu Tyr Glu Lys Phe Arg Tyr Pro Phe Ala 
305 310 315 

CCA ATG TAT GGA GGT TTT CCT GTG AAG TTG CGG ACC TTC TTG GGT GAT 1010 
Pro Met Tyr Gly Gly Phe Pro Val Lys Leu Arg Thr Phe Leu Gly Asp 
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320 325 330 

CCT ATT CCG TAT GAC CCA GAG GTA ACA GCA GAA GAA TTA GCT GAA AAG 1058 
Pro lie Pro Tyr Asp Pro Glu Val Thr Ala Glu Glu Leu Ala Glu Lys 
335 340 345 350 

ACT AAG AAC GCT GTT CAA GCT TTG ATC GAC AAG CAC CAA AGG ATA CCG 1106 
Thr Lys Asn Ala Val Gin Ala Leu lie Asp Lys His Gin Arg He Pro 
355 360 365 

GGG AAC ATT AGG AGT GCT TTG CTG GAC CGC TTT CAT AAA GAG CAG AAG 1154 
Gly Asn He Arg Ser Ala Leu Leu Asp Arg Phe His Lys Glu Gin Lys 
370 375 380 

GCT AAT TGACAGATAA TTTCGTGTGT CTCTAATGCC AAGTTTGCAT CTGTGGTACT 1210 
Ala Asn 



GCCTTTTAAA 


CTTTTGTAGG 


TTGTATAATA 


GTCTTTTTAA 


AACCTGCTAA TGAGCATCTT 


1270 


TCTTTAGAAT 


TTAAACTTTC 


TATGGTTAAT 


CTTACCCTTT 


CAATCAGTTT TGTCAAATGT 


1330 


AGTAAGTAAC 


TCACCATTTG 


CTTGACTCAG 


AAGATGATCA 


TGTTGCATTT GCAATCCTTA 


1390 


GTAGTATATG 


ATAGGCACAA 


GTCTTCCTGG 


ATAACAGCAG 


TCTAGGTTAG GCAGACTTCA 


1450 


CTAAATTATG 


CTTCTGTGTT 


TGTGTCTGTT 


GTGCCAATGG 


GATGGTGAGA GGTACCTGAG 


1510 


CTGGGGTGTA 


GCACTTGCTT 


AGTTTAGGTG 


AGCGGTGCCT 


TTGTATCCAT CCf TCATTCC 


1570 


ACTCACAACT 


GCTGGTAAGT 


TGTGGAACGC 


TTTTGCACCT 


CCGTGGTTTA TGTTAAGCAT 


1630 


GGTAATACTG 


TGGTGGTGAA 


CATTAAGTTG 


CTTATGTCTA 


CTGTGATATA TCCGTATCTA 


1690 


TATATCTGAT 


AGATATCTGT 


AGTGAGAAAT 


CTGTAATTGA ATTGATTCCT GAATCCTTTA 


1750 


GGAGTTATGT 


ACATTTTTGT 


CCATTGGGTA 


TTGAAATTAC 


TCCTTCAGGG ATGATGTCCA 


1810 


GTTGCCAAGC 


ACAATGAAAG 


AGATGTACTA 


TTTTGTAAAT 


CAAATCATTG TTACTTGTAG 


1870 


TATTTATTAA 


CAGAGTTGTT 


TGGTATTTAT 


TTTAAACAAG 


GATCTAAGTA AATGTATAGG 


1930 


ATGGATTGTC 


AGCACGTGTC 


TTACACGGTG 


TAAAGATAAA 


ATCGTGGGCA CGCTCTGCAG 


1990 


TTTATGTTAG 


AGCAGATCCT 


AGGCTGAGGA 


GTGACAACTG 


AGGAAGCTCT GCCTCGCTTA 


2050 


TCCCAGGGCT 


CGCTTCCGTG 


CTCTGCAGCC 


AGTCACGTGG 


TTATATCTGT AAAATGTACT 


2110 


TGAATGATGA 


ATTTCTGTAA 


ACATGGAGTC 


CTCTGAAAAT 


GTAATTACTT TGAATTGTTG 


2170 


TCTTCTTGAA 


AACTTATTTT 


AAAAGTATTT 


AAACAGCCAA GTATCATGAA TTCATCGCAG 


2230 


TGTGTCTTCC 


CCCTTTGCAG 


TACTTGGGTT 


TTATATGGAC 


TTTTTAACTA CAAGTGTATA 


2290 


TTTGTGATAT 


GTATAATGTA 


TGTATAATGT 


ATATAGAAAT GGTGTGAAAA TAAAGCTGAA 


2350 


CTGCAACTCC 


TCTCTAGTAA 


AAA 






2373 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Asp Phe Leu Val Arg Phe Thr Pro Ala Ala Ala Gly Leu Cys Ser Leu 
1 5 10 15 

Thr Glu Trp Cys Cys Val Thr Trp He Cys His Phe Tyr Ser Val He 
20 25 " 30 

Ser Trp His Cys Ser Asn Thr Tyr Gly Asn Glu Glu Arg Asp Arg He 
35 40 45 

Trp Asp Phe His Leu Arg Lys Met He Asn Asn Asn Gin Thr Cys Ala 
50 55 60 

Ala Gly Gin Asp Ser Val Pro Tyr Val Thr Cys Met Leu His Val Leu 
65 70 75 80 

Glu Glu Trp Leu Gly Val Glu Gin Leu Glu Asp Tyr Leu Asn Phe Ala 
85 90 95 

Asn His Leu Leu Trp Val Phe Thr Pro Leu He Leu Leu He Leu Pro 
100 105 110 

Tyr Phe Thr He Phe Leu Leu Tyr Leu Thr He He Phe Leu His He 
115 120 125 

Tyr Lys Arg Lys Asn Val Leu Lys Glu Ala Tyr Ser His Asn Leu Trp 
130 135 140 

Asp Gly Ala Arg Lys Thr Val Ala Thr Leu Trp Asp Gly His Ala Ala 
145 150 155 160 

Val Trp His Gly Tyr Glu Val His Gly Met Glu Lys He Pro Glu Gly 
165 170 175 

Pro Ala Leu He He Phe Tyr His Gly Ala He Pro lie Asp Phe Tyr 
180 185 190 

Tyr Phe Met Ala Lys He Phe He His Lys Gly Arg Thr Cys Arg Val 
195 200 205 

Val Ala Asp His Phe Val Phe Lys He Pro Gly Phe Ser Leu Leu Leu 
210 215 220 

Asp Val Phe Cys Ala Leu His Gly Pro Arg Glu Lys Cys Val Glu He 
225 230 235 240 

Leu Arg Ser Gly His Leu Leu Ala He Ser Pro Gly Gly Val Arg Glu 
245 250 255 

Ala Leu Leu Ser Asp Glu Thr Tyr Asn He He Trp Gly Asn Arg Lys 
260 265 270 

Gly Phe Ala Gin Val Ala He Asp Ala Lys Val Pro He He Pro Met 
275 280 285 

Phe Thr Gin Asn He Arg Glu Gly Phe Arg Ser Leu Gly Gly Thr Arg 
290 295 300 
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Leu Phe Lys Trp Leu Tyr Glu Lys Phe Arg Tyr Pro Phe Ala Pro Met 
305 310 315 320 

Tyr Gly Gly Phe Pro Val Lys Leu Arg Thr Phe Leu Gly Asp Pro lie 
325 330 335 

Pro Tyr Asp Pro Glu Val Thr Ala Glu Glu Leu Ala Glu Lys Thr Lys 
340 345 350 

Asn Ala Val Gin Ala Leu lie Asp Lys His Gin Arg lie Pro Gly Asn 
355 360 365 

He Arg Ser Ala Leu Leu Asp Arg Phe His Lys Glu Gin Lys Ala Asn 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 39: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW047 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 8 . . 511 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: complement (1675.. 1833) 
(D) OTHER INFORMATION: /label= SAC_23915 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GTCGACC AGG GCC TTC CAT CAG ACC TGC CCA AAC TTT GTC ATA GAG AAG 49 
Arg Ala Phe His Gin Thr Cys Pro Asn Phe Val He Glu Lys 
15 10 

ATT GAA AGG ATT CAA AAT CCA GCT CTT TGG AGG AGG TAC CAG GCG TAC 97 
He Glu Arg He Gin Asn Pro Ala Leu Trp Arg Arg Tyr Gin Ala Tyr 
15 20 25 30 

AAA AAA ATT ATG GAT GAA AAG AAT GGC AAT GTG ATA AAT GAG AAG CAA 145 
Lys Lys He Met Asp Glu Lys Asn Gly Asn Val He Asn Glu Lys Gin 
35 40 45 

CTC TTC CAT GGG ACG GAG TTT GGC TCT CTG GCA CAG CTC AAC AGT AAT 193 
Leu Phe His Gly Thr Glu Phe Gly Ser Leu Ala Gin Leu Asn Ser Asn 
50 55 60 

GGA TTT AAC CGC AGC TAT GCT GGG AAG AAC GCT ACG GCT TAT GGG AAA 241 
Gly Phe Asn Arg Ser Tyr Ala Gly Lys Asn Ala Thr Ala Tyr Gly Lys 
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65 70 75 

GGA ACC TAT TTT GCA GTC AAT GCT TCC TAT TCT GCC CAT GAC ACA TAT 289 
Gly Thr Tyr Phe Ala Val Asn Ala Ser Tyr Ser Ala His Asp Thr Tyr 
80 85 90 

TCC AAA CCA GAT GCC AAT GGG AAA AAG TAT ATG TAT TAT GTG CGG GTC 337 
Ser Lys Pro Asp Ala Asn Gly Lys Lys Tyr Met Tyr Tyr Val Arg Val 
95 100 105 110 

CTC ACT GGA AAC TAC ACA CAA GGA AAC GCA TCA CTG ATT GTG CCT CCT 385 
Leu Thr Gly Asn Tyr Thr Gin Gly Asn Ala Ser Leu lie Val Pro Pro 
115 120 125 

TCA AGG GAC CCT CAA AAT GCT ACT GAC TTG TTT GAC ACT GTC GCA GAT 433 
Ser Arg Asp Pro Gin Asn Ala Thr Asp Leu Phe Asp Thr Val Ala Asp 
130 135 140 

AAT GTT ATA AAT CCG AGT ATA TTT GTA GTG TTT TAT GAC AAT CAA GCA 481 
Asn Val lie Asn Pro Ser lie Phe Val Val Phe Tyr Asp Asn Gin Ala 
145 150 155 



TAC CCT GAG TAT CTT ATC ACA TTT AGG AAG TAACACTTTG GGGTGCCTTC 
Tyr Pro Glu Tyr Leu lie Thr Phe Arg Lys 
160 165 


531 


TGAAAGACAT 


TGGTATCTGC 


CGTAGGACAA ATTTTAGCTG 


TTTTCCTTCT 


TAGACTTTCT 


591 


TCTCTAAGAG 


CACGAGGTCT 


TTCTCTGCCA 


CGGATTGTTT 


CCTCAGCTGC 


CCCTTCATAG 


651 


TGAACATGAA 


CCTACTTGAG 


GGCCAACCAC .TTGAGAATTT 


CAAGCAGGTG 


ACAGCTTTAG 


711 


CTGCCTTCAC 


AAATGTGAAC 


CTCAAATCCA 


CGGAAAAGAA 


CAAACTTACG 


TTCTGGAAGG 


771 


ACAATAAAAC 


CTTTATGCAC 


TTTCAGAAAG 


CCCACAATAC 


ACCCACCACC 


AGTGGAGTTA 


831 


CATCAGAGCA 


CCACTGTGCT 


GAGGGGCACC 


CAGGACCACT 


GAGCCAGTAA 


GGCTGCCAAC 


891 


TGAACAGATT 


CCAAGACAGA 


GAACTTCGTT 


CATCACTAAC 


AGCGCCGTGT 


CCTGTGGGTT 


951 


GGACAGTTCC 


AGCCAGGGTT 


ATGTACAGAC 


TGGGTTCTGG 


GATGTGGCTC 


TGGGTCTGTG 


1011 


TGAACTGAGC 


CCCCCAAGGA 


GAGGCACAGC 


TGAAGCCCTG 


TCAGGGGGCC 


CTTTTAATGT 


1071 


TGACACCAAT 


GTTGAGGCTA 


CCTAGTGAAG 


AAATGGGATG 


GGGCCTGCTA 


CCTCTCCTGT 


1131 


GGAAGTCCTT 


CTACTGGGTG 


AAAATGAATG 


AACTCTAACA 


GTATTGGCTA 


GCGTCTTTCT 


1191 


TGTGGCAGGC 


TGGCTTGTGA 


TAACTTGAAG 


CTTTATGACT 


AAGTCAGTCA 


TTTTGCTAAC 


1251 


TCACTAACAG 


TATCTACTGG 


ATGCGATTTG 


TTTCTTGTGA 


TGCCAGAGAT 


CACCCAGAGC 


1311 


CTCACATGGG 


CCAGGCAAGC 


ACTCTACTAC 


TGAACTATAT 


TCCCAGCCTC 


TTAGTGAGGC 


1371 


AAGACTCCAC 


TAAGTTGCCC 


TGGCAGGTTT 


GAACTCATTA 


TCTTCCTGCA 


TTGGCTACTT 


1431 


ATTGGGGTGG 


GGTGGGGTGC 


TTAAATTGTT 


CTTTTAATTT 


TCTCCTGTAC 


TCCCATTTAT 


1491 


TGAGTGCCTG 


GACCCTTGTC 


AGGTAGAATA 


AGAGGTGTGT 


TCACATAGAT 


AATTTTGTTT 


1551 


CCATAAAAAA 


AAATGCACTG 


TAAGAATTAT 


GTCAAGGCCT 


AGGAGCAGGC 


TCAGAGTACA 


1611 


AGAGCTGTCC 


TTGCCAAAGG 


TCACTAAGGC 


AGGAACCGCA 


GAGCCAGCCA 


CCAGAGTCTT 


1671 
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GGTACTGGGT 


TGAGCCCCTC 


ACTATTCCCA GCACAGGGCC 


TGGCCCTGAG GTAAGCCTGT 


1731 


GCTGCTTCAG 


AAGTTGGCTT 


TTCTCCTTCT GGGATTGGTG 


TCCACTCTGC CCCTTTTCTA 


1791 


GGTCATGGAG 


CTAGTGCGGT 


GTAATGTTCC CAGGAGGTAG 


GTACACTCTT AGGTGTTGTG 


1851 


GGGATCCAAG 


TGCTGGTTCC 


CACCATTGGC ATTAACAGTG 


GACTGCAAGC AGAGAGAGGG 


1911 


GTCCCCATCA 


TTGTTTCGTG 


GACACTGAGG CCTTGAGACC 


TTGGCTGCTG TGTCTATGTG 


1971 


GACTGGCAAA 


GACTGTGGCT 


TGGGAGGGGC TTCCTCAGCA 


CTGGCACCTG AACTCAGCAC 


2031 


CAAGGATGGC 


CCCAAGAGGC 


TTGTTTTCAT TTGCCTTTGT 


TCTCCAGTGC TTTGGTTACC 


2091 


TCAGGGCAGG 


GCTGAACAAT 


AGTGTGATGT TACATTGAAG AATAAAGACC CAACCAACAA 


2151 


ATGTTAGTGT 


CATTTTCAGT 


CTAAGTAAAT AAGCCATTAT GTCATTTCCC TTACTTCCTA 


2211 


GTAAAGTTAT 


GTAATTACAT 


AATGTAAAAA 




2241 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Arg Ala Phe His Gin Thr Cys Pro Asn Phe Val He Glu Lys He Glu 
15 10 15 

Arg He Gin Asn Pro Ala Leu Trp Arg Arg Tyr Gin Ala Tyr Lys Lys 
20 25 30 

He Met Asp Glu Lys Asn Gly Asn Val He Asn Glu Lys Gin Leu Phe 
35 40 45 

His Gly Thr Glu Phe Gly Ser Leu Ala Gin Leu Asn Ser Asn Gly Phe 
50 55 60 

Asn Arg Ser Tyr Ala Gly Lys Asn Ala Thr Ala Tyr Gly Lys Gly Thr 
65 70 75 80 

Tyr Phe Ala Val Asn Ala Ser Tyr Ser Ala His Asp Thr Tyr Ser Lys 
85 90 ' 95 

Pro Asp Ala Asn Gly Lys Lys Tyr Met Tyr Tyr Val Arg Val Leu Thr 
100 105 110 

Gly Asn Tyr Thr Gin Gly Asn Ala Ser Leu He Val Pro Pro Ser Arg 
115 120 125 

Asp Pro Gin Asn Ala Thr Asp Leu Phe Asp Thr Val Ala Asp Asn Val 
130 135 140 

He Asn Pro Ser He Phe Val Val Phe Tyr Asp Asn Gin Ala Tyr Pro 
145 150 155 160 
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Glu Tyr Leu lie Thr Phe Arg Lys 
165 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



<vii) IMMEDIATE SOURCE: 
<B) CLONE: HW049 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: complement (1289.. 1612) 
(D) OTHER INFORMATION: /label= SAC_24337 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



GTCGACGTGG 


TGATTATCTT 


AAAGGTAAGG 


GAGGTTTTTT 


TTTCCCTGAT 


ACTGATGAAG 


60 


GCTGAGTGTG 


ACTGTTTAGT 


ATAGCTAGTG 


GGCTTTGTCA 


CTTAAGCCTT 


CCTTAGCATG 


120 


TTCCCTTTCC 


CTTATTTTTC 


CTGGATACTT 


ACGTGCTAAC 


GTGCTAACAA 


CTCCAGATCA 


180 


AACTGAGTAT 


GGAGATTTTT 


GCTTTAACAA 


TTTAGGAGTG 


GCAAACAATT 


TATAACTGGC 


240 


AGGGTTTGAG 


GACAGCCGTG 


TTGAAAAACC 


AAAAGAAAAT 


GCTAAGGCAG 


GTCTTGACCA 


300 


ATGTGGATAC 


TTTGACTAAT 


TAACTTCGTT 


ACTTTAGGAG 


GAGGTTGTTC 


CTGCACGCCA 


360 


TCCATCTCCA 


GCGTTCGAAG 


GTTAGATTGT 


TGGGGTGGTG 


TGTCTTGTTC 


ACCAGATTGG 


420 




GATGAATCAA 


CTAGCTCACT 


ATGACTGCTA 


ATGAAAACAC 


TGGAACACCT 


480 


GAGAAACTGA 


AGGGTGGCGT 


GTTGGGGCAA 


GGGTGCTTAC 


ATATCAAGCC 


TTTTCAATAG 


540 


TTTGTTTTTC 


TCCAGGTTCT 


GTTAGCAGTG 


CTGTTACTTG 


GTGCTTCAGA 


GGTAAGTAAT 


600 


TGCAGCCAGC 


GTGCAGTGGT 


GTGCACACTG 


CTGGACTCAG 


GTCCTAAGTA 


CTTCATATAT 


660 


ATGATGACTT 


TCATAGAATC 


TCGTTCGGCT 


GATGATTGCT 


GTTGAGACTT 


GGAAATCTGA 


720 


TTTTTCTGAG 


GATGATGGGT 


TGGATGGGAA 


CAATCTGCCA 


GGATTTTTAA 


CATTGCCCAT 


780 


CTCGTGTGTT 


TCAGATATGG 


TGACTGGCCG 


AGCGCAAAGG 


AGCCATGCCA 


GGTAAATGTT 


840 


CTAGCAATAT 


TATTGTTTAT 


TAGGTAGATA 


CGTAATGGCT 


TGATCCAGTG 


CTTTCCTGCT 


900 


CACCCTGATG 


AACTGAATAC 


CGCCCCAGTC 


TGATAGCTGT 


GGAGAAAGGT 


ATTTTCTGAG 


960 


TTGAGCTCAT 


CTTGGATCAA 


ACAGGAAGGG 


TTTTATGCTA 


GTTAGTATAT 


GATCTCTGTA 


1020 
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PJPJ AT A P"PiTTT 
OOnX AUjI 1 1 


APiPiP A ATA APi 


PiATAATP4APP 
On x nn lunvL 


P" A A A'i"t J l l, A*i"i* 
LnnnX X XnX X 


X nv, Xo 1X1 Xo 


c a pip* a a a a 
LAOoAAAAu 1 


i Aon 


A HdHfir* hp AT 
nn OnOVn V— n X 




TPP AP4P4PT A A 
X N^v^noov. inn 


AP4T A AP4P4TP A 
no x nnuu x on 


P. A A A APT A API 


ATPTA A apTP 
n 1 o 1 Annu lO 


1 1 ah 

114U 


InULn/iui X n 


p.t a apttppp 


TATP.AAP.AA A 


PATTPAPAPPi 


XV X InL 1 1V.L 


TP*TPV^T A P*P"T* 
XO 1\, v» 1 AbC 1 


1 ^PiPi 

izuu 


wV^nOnOW. iu 


AAAAP4P4TP4AA 
nnnnVJU XOnn 


ppp a c^rctrztzn, 

^unu ±oooo 


PTYiPiPTHPiPiPi 
V* 1MVJV. XOOOO 


Pi A A A API A PIP! A 
OnnnnOnOOn 


nnX 1 1 lol IV 




PAfiA Af^PiAAP 

V-~nOnnOOnnV» 


x o x v. i\xnuv3v 


ni\Jll X XOXn 


TP.TP.TPJPi A A A 
XO X o luVJAnn 


fZPP* A A fZ A A TT 
OOonnonnl 1 


A A APTTTP* AT* 
AAAv. 1 1 xoAi 


1 ^0 Pi 


TA A A A APPiA A 


APAA^TfinT 
AwiAVaVj xoo x 


X X O IV. Xo XXX 


X iVnvu 1 V- x o 


RPPXPAPPPi 
nOoAoAOOOA 


loAooToToT 


1 1 QPt 


OX InVViUUlA 


PiP/PAPiPPPTT 


PAifiTPATTfi 


XnnX InOoio 


OOA- AOOO 1 no 


ATTTTGCTTT 


144U 


PTPiPTA ATTA 
C I vjV— 1 nn 1 In 


A A 1PTTTT1P 
nnn^-. XXX InL 


P'PTAPAfifilAA 
LI 1 nwiuunn 


PTPi A A A TP 1 A A 


A P* TV TlTT'P ?\ mrn 


AoCTAooCCA 


1 con 


O iuo L~nl* UOL 


A A A P*TTP A A 


nL i\.n luviAA 


x AuAbLLTGT 


TGATTCC TTG 


TGGATTTTTA 


1560 


PiP A TP 1¥2 A PiP 


A PIT A T P A PiTPi 


P2PAPTPP A IP 


lurtM XOO X 1 


P* T\ HV*FT*P"TV1V"^ TV 


oTACCGCTGG 


1620 


A6CA6ATTAA 


TATTTATTTC 


ACTGCCAGAG 


GCTATATTTC 


TCTGCCCCCC 


CATACCCCCT 


1680 


TCCTCTTCTC 


CTTAGACTAC 


TTAAGATTGA 


AATATTTGCA 


ACTCCAAATT 


TATCTTCCTT 


1740 


GCTTACCACT 


TTTTTTTCCG 


AGCTGGGGAC 


CAAACCCAGG 


GCCTTGCGCT 


TCCTAGGTAA 


1800 


GCGCTCTATC 


ACTGAGCTAA 


ATCCCCAGCC 


CCGCTTACCA 


CATTGTTAAT 


CTGATCCTTC 


1860 


CGCAGTTATT 


TGTGCTGTTC 


TTTTTGTTCC 


TGGCAATAAA 


TGCTTTTTAT 


GCTAAAAA 


1918 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW050 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 1326 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: complement (3542.. 3733) 
(D) OTHER INFORMATION: /label= SAC_24644 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

GTCGAC ATA AAT ATG GCC GTC CGG GTA CCC CTC GGC TGC ACG GGC TCC 48 
He Asn Met Ala Val Arg Val Pro Leu Gly Cys Thr Gly Ser 
15 10 

TTC TGC CCG CGG CTG CTA CCA CTG CTG GCA TTG TTG GAG CTG CTC GTC 96 
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Phe Cys Pro Arg Leu Leu Pro Leu Leu Ala Leu Leu Glu Leu Leu Val 
15 20 25 30 



GAC CCC AGC TTG GGC CGC GTC CAT CAC CTG GCG CTC AAA GAT GAC GTG 
Asp Pro Ser Leu Gly Arg Val His His Leu Ala Leu Lys Asp Asp Val 
35 40 45 

CGG CAC AAA GTC CAT CTC AAC ACC TTC GGC TTC TTC AAG GAC GGG TAC 
Arg His Lys Val His Leu Asn Thr Phe Gly Phe Phe Lys Asp Gly Tyr 
50 55 " 60 

ATG GTG GTC AAC ATC AGC AGC CTC TCT GTG AAC GAG CCT GAG GGA GTC 
Met Val Val Asn lie Ser Ser Leu Ser Val Asn Glu Pro Glu Gly Val 
65 70 75 

AAG GAC AAA GAC ACT GAG ATT GGC TTC AGT CTT GAT CGG ACC AAG AAT 
Lys Asp Lys Asp Thr Glu lie Gly Phe Ser Leu Asp Arg Thr Lys Asn 
80 85 90 



144 



192 



240 



288 



GAT GGC TTT TCT TCT TAC CTG GAT GAA GAT GTG AAT TAC TGT ATT TTA 
Asp Gly Phe Ser Ser Tyr Leu Asp Glu Asp Val Asn Tyr Cys lie Leu 
95 100 105 110 

AAA AAA AAG TCC ATG TCT TCT GTC ACT CTG CTC ATC TTA GAC ATC TCC 
Lys Lys Lys Ser Met Ser Ser Val Thr Leu Leu lie Leu Asp lie Ser 
115 120 125 

GGA AGT GGA GTC AAG GTC AGA TCC CCA CCA GAA GCT GGC AAG CAG TTA 
Gly Ser Gly Val Lys Val Arg Ser Pro Pro Glu Ala Gly Lys Gin Leu 
130 135 140 

CCC GAG ATT GTC TTC AGC AAG GAT GAG AAA GTC CCG AGT CGG AGC CAG 
Pro Glu lie Val Phe Ser Lys Asp Glu Lys Val Pro Ser Arg Ser Gin 
145 150 155 

GAG CCT GCT GTC AGC TCC AAC CCC AAA GAC AGC AAG GTG CAG AGA ACC 
Glu Pro Ala Val Ser Ser Asn Pro Lys Asp Ser Lys Val Gin Arg Thr 
160 165 170 

CCT GAT GGT TCC AAG GCT CAA CGA AGT ACA GTG GAT TCA AAG ACT ATA 
Pro Asp Gly Ser Lys Ala Gin Arg Ser Thr Val Asp Ser Lys Thr lie 
175 180 185 190 

GCA GAG AAA TTC TTC TCA ATT CAT AAG AAT GAT GGG GCA GTT TCA TTT 
Ala Glu Lys Phe Phe Ser lie His Lys Asn Asp Gly Ala Val Ser Phe 
195 200 205 

CAG TTC TTC TTT AAC ATC AGC ACC AGT GAC CAG GAA GGC CTC TAC AGC 
Gin Phe Phe Phe Asn lie Ser Thr Ser Asp Gin Glu Gly Leu Tyr Ser 
210 215 220 

CTT TAC TTC CAC AAG TGC CCA AGC AGC AAG CTG AGG TCT GGT GAG CAG 
Leu Tyr Phe His Lys Cys Pro Ser Ser Lys Leu Arg Ser Gly Glu Gin 
225 230 235 



336 



384 



432 



480 



528 



576 



624 



672 



720 



GTC TCA TTC AGC CTA AAT ATT GAT ATC ACC GAG AAG AAT CCT GAC AGC 
Val Ser Phe Ser Leu Asn lie Asp lie Thr Glu Lys Asn Pro Asp Ser 
240 245 250 

TAC CTG TCT GCA GGG GAA ATC CCT CTC CCC AAG TTA TAC GTT TCC ATG 
Tyr Leu Ser Ala Gly Glu lie Pro Leu Pro Lys Leu Tyr Val Ser Met 
255 260 265 " 270 



768 



816 
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GCC TTG TTG TTC TTC CTG TCT GGG ACC GTG TGG ATT CAC ATC CTT CGT 864 
Ala Leu Leu Phe Phe Leu Ser Gly Thr Val Tip lie His lie Leu Arg 
275 280 285 

AAA CGA AGG AAT GAT GTA TTT AAA ATT CAC TGG TTG ATG GCG GCC CTT 912 
Lys Arg Arg Asn Asp Val Phe Lys lie His Trp Leu Met Ala Ala Leu 
290 295 300 

CCT TTC ACC AAG TCT CTC TCC TTG GTG TTC CAT GCA ATC GAC TAC CAC 960 
Pro Phe Thr Lys Ser Leu Ser Leu Val Phe His Ala lie Asp Tyr His 
305 310 315 

TAC ATC TCC TCG CAG GGC TTT CCG ATT GAA GGC TGG GCT GTT GTG TAC 1008 
Tyr He Ser Ser Gin Gly Phe Pro He Glu Gly Trp Ala Val Val Tyr 
320 325 330 

TAC ATA ACT CAC CTG CTG AAG GGT GCA CTG CTG TTC ATC ACC ATC GCC 1056 
Tyr He Thr His Leu Leu Lys Gly Ala Leu Leu Phe He Thr He Ala 
335 340 345 350 

CTC ATT GGC ACT GGC TGG GCC TTC ATC AAG CAC ATC CTG TCT GAT AAG 1104 
Leu He Gly Thr Gly Trp Ala Phe He Lys His He Leu Ser Asp Lys 
355 360 365 

GAC AAG AAG ATC TTC ATG ATT GTC ATT CCG CTC CAG GTA CTG GCG AAT 1152 
Asp Lys Lys He Phe Met He Val He Pro Leu Gin Val Leu Ala Asn 
370 375 380 

GTG GCC TAC ATC ATC ATA GAG TCT ACT GAG GAG GGC ACA ACT GAG TAT 1200 
Val Ala Tyr He He He Glu Ser Thr Glu Glu Gly Thr Thr Glu Tyr 
385 390 395 

GGC TTG TGG AAG GAC TCT CTA TTC CTG GTG GAT TTG CTG TGC TGC GGC 1248 
Gly Leu Trp Lys Asp Ser Leu Phe Leu Val Asp Leu Leu Cys Cys Gly 
400 405 410 

GCC ATC CTC TTC CCA GTG GTG TGG TCA ATC AGG CAT TTA CAA GAA GCC 1296 
Ala He Leu Phe Pro Val Val Trp Ser He Arg His Leu Gin Glu Ala 
415 420 425 430 

TCG GCC ACA GAT GGA AAG CTG CCA TTA ACC TAGCAAAGCT GAAGCTTTTC 1346 
Ser Ala Thr Asp Gly Lys Leu Pro Leu Thr 
435 440 



AGACATTACT 


ACGTCCTGAT 


CGTGTGCTAC 


ATCTACTTCA 


CCAGGATCAT 


CGCCTTTCTT 


1406 


CTCAAGTTCG 


CTGTTCCCTT 


CCAGTGGAAG 


TGGCTCTACC 


AGCTGCTGGA 


TGAAACAGCC 


1466 


ACACTGGTGT 


TCTTTGTCCT 


GACGGGGTAT 


AAATTTCGCC 


CAGCTTCAGA 


TAACCCCTAC 


1526 


CTGCAGCTCT 


CTCAGGAAGA 


AGATGACCTG 


GAGATGGAGT 


CTGTAGTGAC 


GACGTCAGGA 


1586 


GTAATGGAGA 


ACATGAAGAA 


GGTGAAGAAA 


GTGAGCAATG 


GTGCTGTGGA 


GCCCCAGGGC 


1646 


AGCTGGGAAG 


GCACCGCGTG 


ACAGCGCAGC 


TGAGGACCGC 


AGGCCGTCAA 


TGAACCTAGC 


1706 


TTATCCATAG 


CCCTATCAGT 


GAGCACGCCT 


GTTCCTGACA 


GACGATGGCA 


TCTCCCAGCA 


1766 


GCGACACCCC 


GGCGCTCAGC 


ACCATGGCCG 


AGCCGAGTGC 


TATGGACACA 


CACTTTTGTA 


1826 


CTCTTAGGGA 


TCTGGATTGA 


GGTGGGCTAC 


AGGCAGTGAG 


GACTGCCCTC 


AGTGAGGGCG 


1886 


GAGGCTTAGA 


GGAGGAGTAA 


AGACTGTCTG 


TCTGATTTAC 


CTTGCTAATT 


GGGAGCCTGG 


1946 
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GTCTGTGCAG GGGAGCCCCC 


AGTGGCTGCT 


ACTCAGAAAG 


GCTGTGGGTG 


TGGCACGGGT 


2006 


CTGCCACTCG CTGGTCACAG 


TTCTGTGGAG 


GCTGGGAGGA TGTGACTCAC 


TGCTTGGCCT 


2066 


AGCATTTACG AGGCTTTGGG 


TTCAATTCCT 


AGTGCCAGGG 


TGGGGGGAAA 


AGAAGTTACT 


2126 


TTGGGACTAG TCAGGTATCT 


ACCAAGGGCA 


CAGAGAAGAG 


AAAGAGATTG 


TCAGCCTTTG 


2186 


GGAACAAGAA CAACAGAAAT 


GATTGAAATT 


CTTTTAAATA 


GCCAATAAAC 


CCCTAGGTGG 


2246 


CAGAAGACAC CTGTCCAGGG 


TCACTGGCCC 


CCAGGCACCA 


CTCTAAATGT 


CCCTGCCCCT 


2306 


GATGTGACCT ACAACAGTGG 


TGCTAGAGGC 


CAGAGGGAGG 


GAGGGAGGGG 


GCCTGTTACA 


2366 


TGGCTGGCTG CAGGGGAGAA 


GAGCCTTGGC 


TCTCCAGAAG 


TGTTCTCCAG 


TTGGCTCTCA 


2426 


GGACTCCATC CCCTGGCTGC 


CTGCCCAGAG 


CAGCCGCCTG 


GCTTCCTCTC 


CTTCCACCTG 


2486 


TGTCTTCTGA TCCGAGAAAA 


CAGTTTTCCT 


GAAGGAGCAG 


CCGAGGACAG 


AAAGGGTAGT 


2546 


CTCTAAACAA ACACTTAGGT 


TAAGCATCTT 


CCACAGCTCG 


CATGGGCCAG 


GCAGGACCTC 


2606 


TGAGCCTCAG CTATAGTCCA 


AGCATGCGTG 


AAGGAAGTGT 


TTGAGCCACT 


CATGCTCAGA 


2666 


GACCTGAGGC AGCCGGGTCT 


GCTCAGAGCT 


TGGACAGGGA 


CACCAGCGGC 


ACCTGCGGAA 


2726 


GGCCTGTGAG GGGCCACAGG 


AGACCAGCTC 


AGCACCTCAG 


AGCTGCTGCC 


TGCCCAGAAC 


2786 


TGGCGGGCGA GTTGCACCTT 


CAAGGAACAG 


GCACTTCCGG 


GAGGGGGGTG 


AGTTACTCCA 


2846 


CCTCCCAGTG CAGAAGACAA 


AGGTCTGGTC 


TCCACACCAC 


ACCCCAAGTC 


CCTGTCCTGT 


2906 


GCCACCAGTG AGAAGCTGCT 


GTCTGCTGCC 


CTAGCAGGTC 


CGTCACTACC 


GCAGGTGCTC 


2966 


CATCTTGAGA AAGGTCGACA 


GGCCTGAAGG 


ATCCACCTAA 


AGGACCTTTT 


TCTCCTGTTT 


3026 


TGTTTGCTTT GTTTATACTT 


GTTTAAAGTA 


TGTGGTGGGA AGGGAGTGTC 


GAGGAGACAA 


3086 


GGTGTTAATG CTTTGTCTTC 


TGCACATAAG 


CACTCCTCCA GCACTGCAGA 


AGTGTGTCCC 


3146 


TGTGGTGACT CTAAGCCACA 


GCTGTCCCTC 


AGAGGGCGCG ATCTTTATGG 


AAGGAGCCTC 


3206 


CTTTCACTTG CTGGGTTTTA 


ATAAGGCCAC 


TCTGACAGAG 


AGCAGAGGGA 


GCTGTGTACA 


3266 


ACCCAGCCAC AGAGCTCTCT 


GGGTGCACAG 


CTGAGCGGCT ATGGGAGACA 


GCCCGCTCCT 


3326 


TCCAGAGCAT GGGTGCACAG 


GCAGGGAGGT 


GTGCTACAGC 


AGCGGCCCTC 


CCAAGTCCTG 


3386 


CCCTGTGACT GCTGTAGGTG 


GGAAGCCAGG 


CATGTTGGGG 


GTTGTGGTGG 


GGAGGGGGGA 


3446 


TCAGAAGCCT AACATGGGGC 


TGGGAAGAAC 




CCCAGGCTGT 


GGAAAGGAAA 


3506 


ACTCCCAGCC CTCAGCATGC 


ACTGTAAGGT 


GTCTGTCTCA TCTGCATCGC 


TTGAGAGCTG 


3566 


GGAGCCCGCA TTCAGCAGTG 


TCCTTGGGTA 


GAGGAGTCTG ATTGGCATCC 


AGCCGCCCCA 


3626 


GTTCATGAGC TGCCCTCTGC 


AGCCTTGAGT 


CTTAGCCTAC 


CCAGCTCCAC 


AACCGTGTTT 


3666 


GAGTGGTGAA GCCCTTGTGT 


TGGAGGGGCT 


GAGCAGAAAT GGGGTGTACA 


TTGCACTGCC 


3746 


ACTGTGGTAT TAATAGCTCT 


GTGTTGTGTC 


CCCACCCCCA CCCCCCATCT 


TCTGCCTGCG 


3806 
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TGAGGTATCC ATACTTGTTT TGAGAGTGCT GTGGACTGGA AGCTGTGGGC TGTGACATTT 3866 
AATAAAGTGT GGATTTTGTT AGATAATTAT TTCTTGGACA ACAGGAACAA GTGTTCAGCT 3926 
CTGAAAAAAA GTCGAC 3942 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 440 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

lie Asn Met Ala Val Arg Val Pro Leu Gly Cys Thr Gly Ser Phe Cys 
15 10 15 

Pro Arg Leu Leu Pro Leu Leu Ala Leu Leu Glu Leu Leu Val Asp Pro 
20 25 30 

Ser Leu Gly Arg Val His His Leu Ala Leu Lys Asp Asp Val Arg His 
35 40 45 

Lys Val His Leu Asn Thr Phe Gly Phe Phe Lys Asp Gly Tyr Met Val 
50 55 .60 

Val Asn lie Ser Ser Leu Ser Val Asn Glu Pro Glu Gly Val Lys Asp 
65 70 75 80 

Lys Asp Thr Glu lie Gly Phe Ser Leu Asp Arg Thr Lys Asn Asp Gly 
85 90 95 

Phe Ser Ser Tyr Leu Asp Glu Asp Val Asn Tyr Cys lie Leu Lys Lys 
100 105 110 

Lys Ser Met Ser Ser Val Thr Leu Leu lie Leu Asp lie Ser Gly Ser 
115 120 125 

Gly Val Lys Val Arg Ser Pro Pro Glu Ala Gly Lys Gin Leu Pro Glu 
130 135 140 

lie Val Phe Ser Lys Asp Glu Lys Val Pro Ser Arg Ser Gin Glu Pro 
145 150 155 160 

Ala Val Ser Ser Asn Pro Lys Asp Ser Lys Val Gin Arg Thr Pro Asp 
165 170 175 

Gly Ser Lys Ala Gin Arg Ser Thr Val Asp Ser Lys Thr lie Ala Glu 
180 185 190 

Lys Phe Phe Ser lie His Lys Asn Asp Gly Ala Val Ser Phe Gin Phe 
195 200 205 

Phe Phe Asn lie Ser Thr Ser Asp Gin Glu Gly Leu Tyr Ser Leu Tyr 
210 215 220 

Phe His Lys Cys Pro Ser Ser Lys Leu Arg Ser Gly Glu Gin Val Ser 
225 230 235 240 
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Phe Ser Leu Asn He Asp He Thr Glu Lys Asn Pro Asp Ser Tyr Leu 
245 250 255 

Ser Ala Gly Glu He Pro Leu Pro Lys Leu Tyr Val Ser Met Ala Leu 
260 265 270 

Leu Phe Phe Leu Ser Gly Thr Val Trp He His He Leu Arg Lys Arg 
275 280 285 

Arg Asn Asp Val Phe Lys He His Trp Leu Met Ala Ala Leu Pro Phe 
290 295 300 

Thr Lys Ser Leu Ser Leu Val Phe His Ala He Asp Tyr His Tyr He 
305 310 315 320 

Ser Ser Gin Gly Phe Pro He Glu Gly Trp Ala Val Val Tyr Tyr He 
325 330 335 

Thr His Leu Leu Lys Gly Ala Leu Leu Phe He Thr He Ala Leu He 
340 345 350 

Gly Thr Gly Trp Ala Phe He Lys His He Leu Ser Asp Lys Asp Lys 
355 360 365 

Lys He Phe Met He Val He Pro Leu Gin Val Leu Ala Asn Val Ala 
370 375 380 

Tyr He He He Glu Ser Thr Glu Glu Gly Thr Thr Glu Tyr Gly Leu 
385 390 395 400 

Trp Lys Asp Ser Leu Phe Leu Val Asp Leu Leu Cys Cys Gly Ala He 
405 410 415 

Leu Phe Pro Val Val Trp Ser He Arg His Leu Gin Glu Ala Ser Ala 
420 425 430 

Thr Asp Gly Lys Leu Pro Leu Thr 
435 440 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW051 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 114.. 1505 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1411.. 1603 

(D) OTHER INFORMATION: /label* SAC_24170 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GTCGACGGCT GGTTTGAAAA GTGACAACGG CCGGTGGATT TTAGGAGTTT GCTCGGTTTG 60 

TAACTGCTCT TTGGTGAGCT ACTGGGACTG CAGACTAGGA GGAGACTCCC AAA ATG 116 

Met 
1 

GAA ACT CTG TCC TTC CCC AGA TAC AAC ATA GCT GAG ATT GTA GTT CAT 164 
Glu Thr Leu Ser Phe Pro Arg Tyr Asn lie Ala Glu lie Val Val His 
5 10 15 

ATT CGC AAT AAA CTG TTA ACT GGA GCG GAT GGC AAA AAC CTC TCC AAG 212 
lie Arg Asn Lys Leu Leu Thr Gly Ala Asp Gly Lys Asn Leu Ser Lys 
20 25 30 

AGC GAT TTT CTT CCA AAC CCG AAG CCT GAA GTC CTG TAC ATG ATT TAC 260 
Ser Asp Phe Leu Pro Asn Pro Lys Pro Glu Val Leu Tyr Met lie Tyr 
35 40 45 

ATG AGA GCC TTA CAG TTA GTG TAT GGG GTC CGG CTG GAG CAT TTC TAC 308 
Met Arg Ala Leu Gin Leu Val Tyr Gly Val Arg Leu Glu His Phe Tyr 
50 55 60 65 

ATG ATG CCG GTG AAC ATA GAA GTC ATG TAT CCA CAT ATA ATG GAG GGC 356 
Met Met Pro Val Asn lie Glu Val Met Tyr Pro His lie Met Glu Gly 
70 75 80 

TTC TTA CCG GTC AGC AAT TTG TTC TTC CAC CTG GAC TCG TTT ATG CCC 404 
Phe Leu Pro Val Ser Asn Leu Phe Phe His Leu Asp Ser Phe Met Pro 
85 90 95 

ATT TGC CGG GTG AAT GAC TTT GAG ATC GCC GAT ATT CTT TAT CCA AAA 452 
He Cys Arg Val Asn Asp Phe Glu He Ala Asp He Leu Tyr Pro Lys 
100 105 110 

GCA AAC CGG ACA AGT CGT TTT TTA AGT GGC ATT ATC AAC TTT ATT CAC 500 
Ala Asn Arg Thr Ser Arg Phe Leu Ser Gly He He Asn Phe He His 
115 120 125 

TTC AGA GAA ACA TGC CTG GAG AAG TAT GAA GAA TTT CTT TTG CAA AAT 548 
Phe Arg Glu Thr Cys Leu Glu Lys Tyr Glu Glu Phe Leu Leu Gin Asn 
130 135 140 145 

AAA TCC TCT GTG GAC AAA ATA CAG CAG TTA AGC AAT GCA CAC CAG GAA 596 
Lys Ser Ser Val Asp Lys He Gin Gin Leu Ser Asn Ala His Gin Glu 
150 155 160 

GCA TTG ATG AAA CTG GAA AAA CTC AAT TCG GTT CCC GTG GAG GAG CAG 644 
Ala Leu Met Lys Leu Glu Lys Leu Asn Ser Val Pro Val Glu Glu Gin 
165 170 175 

GAA GAG TTC AAA CAG CTG AAG GAT GAC ATC CAG GAG CTG CAG CAC TTG 692 
Glu Glu Phe Lys Gin Leu Lys Asp Asp He Gin Glu Leu Gin His Leu 
180 185 190 

CTG AAT CAA GAC TTC AGA CAG AAA ACG ACA CTG CTG CAG GAG AGA TAT 740 
Leu Asn Gin Asp Phe Arg Gin Lys Thr Thr Leu Leu Gin Glu Arg Tyr 
195 200 205 

ACC AAA ATG AAA TCA GAT TTT TCA GAG AAA ACC AAG CAT GTT AAT GAG 788 
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Thr Lys Met Lys Ser Asp Phe Ser Glu Lys Thr Lys His Val Asn Glu 
210 215 220 225 

CTA AAG TTG TCA GTA GTT TCT TTG AAA GAA GTT CAA GAC AGT TTG AAA 836 
Leu Lys Leu Ser Val Val Ser Leu Lys Glu Val Gin Asp Ser Leu Lys 
230 235 240 

AGC AAA ATT GTG GAT TCT CCA GAG AAG CTG AAG AAC TAT AAA GAG AAG 884 
Ser Lys He Val Asp Ser Pro Glu Lys Leu Lys Asn Tyr Lys Glu Lys 
245 250 255 

ATG AAG GAC ACC GTC CAG AAG CTC CGC AGT GCC AGG GAA GAA GTG ATG 932 
Met Lys Asp Thr Val Gin Lys Leu Arg Ser Ala Arg Glu Glu Val Met 
260 265 270 

GAG AAG TAT GAT ATC TAT AGA GAT TCT GTG GAT TGC TTG CCT TCC TGT 980 
Glu Lys Tyr Asp He Tyr Arg Asp Ser Val Asp Cys Leu Pro Ser Cvs 
275 280 285 

CAG CTG GAG GTG CAG TTA TAT CAA AAG AAA TCA CAG GAC CTT GCA GAT 1028 
Gin Leu Glu Val Gin Leu Tyr Gin Lys Lys Ser Gin Asp Leu Ala Asp 
290 295 300 305 

AAT AGG GAG AAA CTA AGC AGT ATC TTA AAG GAG AGC CTG AAC CTG GAG 1076 
Asn Arg Glu Lys Leu Ser Ser He Leu Lys Glu Ser Leu Asn Leu Glu 
310 315 320 

GGC CAG ATT GAT AGT GAT TCA TCA GAA CTA AAG AAA CTG AAG ACT GAA 1124 
Gly Gin He Asp Ser Asp Ser Ser Glu Leu Lys Lys Leu Lys Thr Glu 
325 330 335 

GAG AAC TCC CTC ATA AGA CTG ATG ACT CTA AAG AAG GAG AGA CTT GCT 1172 
Glu Asn Ser Leu He Arg Leu Met Thr Leu Lys Lys Glu Arg Leu Ala 
340 345 350 

ACC ATG CAG TTC AAA ATA AAC AAG AAG CAG GAG GAT GTG AAA CAG TAC 1220 
Thr Met Gin Phe Lys He Asn Lys Lys Gin Glu Asp Val Lys Gin Tyr 
355 360 365 

AAG CGG ACC ATG ATT GAA GAT TGC AAT AAA GTT CAA GAA AAA AGA GAT 1268 
Lys Arg Thr Met He Glu Asp Cys Asn Lys Val Gin Glu Lys Arg Asp 
370 375 380 - 385 

GCT GTC TGC GAG CAA GTA ACC GCC ATT AAT CAA GAC ATC CAC AAG ATT 1316 
Ala Val Cys Glu Gin Val Thr Ala He Asn Gin Asp He His Lys He 
390 395 " 400 

AAA TCT GGG ATT CAG CAG CTA AGA GAC GCC GAA AAA CGG GAG AAA CTG 1364 
Lys Ser Gly He Gin Gin Leu Arg Asp Ala Glu Lys Arg Glu Lys Leu 
405 410 415 

AAG TCT CAG GAA ATC TTG GTA GAC TTG AAA AGT GCT TTG GAG AAG TAC 1412 
Lys Ser Gin Glu He Leu Val Asp Leu Lys Ser Ala Leu Glu Lys Tvr 
420 425 430 

CAT GAG GGC ATC GAG AAG ACG ACG GAG GAG TGC TGC ACT AGA ATA GGA 1460 
His Glu Gly He Glu Lys Thr Thr Glu Glu Cys Cys Thr Arg He Gly 
435 440 445 

GGG AAG ACT GCC GAG CTG AAG AGG AGG ATG TTC AAA ATG CCG CCC 1505 
Gly Lys Thr Ala Glu Leu Lys Arg Arg Met Phe Lys Met Pro Pro 
450 455 460 
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TGATCAACAG CCACCCGAAA ATGGCCTTTC GCTTTCTGTT TGGAGTAGTT ATATTGAAGC 1565 

TAATAGAAGG ACCGCAGTCT CAGCTAACTA GCGTGGGTAC CATTGGTTCT CTGTCCTTTT 1625 

ATGACCATGT GCTCCCTGTG TTTTTTTCTT GGTGATGGGG ATACAACTCA GGGCCTTGCA 1685 

GGCTACACTG ACTTGCCTCC CTAGGCTCTA ATGTACCATG TACTATGTAG GCTTTTGCTA 1745 

CAATTAAAGT AACGTGTACA GCTTTTATGT CCCTACTCTG TCTCCTTTTG TATGTGCTGG 1805 

TTGGAATAAA CAAATAGTTA CTGACGTCAA AAA 1838 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Glu Thr Leu Ser Phe Pro Arg Tyr Asn He Ala Glu He Val Val 
15 10 15 

His He Arg Asn Lys Leu Leu Thr Gly Ala Asp Gly Lys Asn Leu Ser 
20 25 30 

Lys Ser Asp Phe Leu Pro Asn Pro Lys Pro Glu Val Leu Tyr Met He 
35 40 45 

Tyr Met Arg Ala Leu Gin Leu Val Tyr Gly Val Arg Leu Glu His Phe 
50 55 60 

Tyr Met Met Pro Val Asn He Glu Val Met Tyr Pro His He Met Glu 
65 70 75 80 

Gly Phe Leu Pro Val Ser Asn Leu Phe Phe His Leu Asp Ser Phe Met 
85 90 95 

Pro He Cys Arg Val Asn Asp Phe Glu He Ala Asp He Leu Tyr Pro 
100 105 110 

Lys Ala Asn Arg Thr Ser Arg Phe Leu Ser Gly He He Asn Phe He 
115 120 125 

His Phe Arg Glu Thr Cys Leu Glu Lys Tyr Glu Glu Phe Leu Leu Gin 
130 135 140 

Asn Lys Ser Ser Val Asp Lys He Gin Gin Leu Ser Asn Ala His Gin 
145 150 155 160 

Glu Ala Leu Met Lys Leu Glu Lys Leu Asn Ser Val Pro Val Glu Glu 
165 170 175 

Gin Glu Glu Phe Lys Gin Leu Lys Asp Asp He Gin Glu Leu Gin His 
180 185 190 

Leu Leu Asn Gin Asp Phe Arg Gin Lys Thr Thr Leu Leu Gin Glu Arg 
195 200 205 
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Tyr Thr Lys Met Lys Ser Asp Phe Ser Glu Lys Thr Lys His Val Asn 
210 215 220 

Glu Leu Lys Leu Ser Val Val Ser Leu Lys Glu Val Gin Asp Ser Leu 
225 230 235 240 

Lys Ser Lys lie Val Asp Ser Pro Glu Lys Leu Lys Asn Tyr Lys Glu 
245 250 255 

Lys Met Lys Asp Thr Val Gin Lys Leu Arg Ser Ala Arg Glu Glu Val 
260 265 270 

Met Glu Lys Tyr Asp lie Tyr Arg Asp Ser Val Asp Cys Leu Pro Ser 
275 280 285 

Cys Gin Leu Glu Val Gin Leu Tyr Gin Lys Lys Ser Gin Asp Leu Ala 
290 295 300 

Asp Asn Arg Glu Lys Leu Ser Ser lie Leu Lys Glu Ser Leu Asn Leu 
305 310 315 320 

Glu Gly Gin lie Asp Ser Asp Ser Ser Glu Leu Lys Lys Leu Lys Thr 
325 330 335 

Glu Glu Asn Ser Leu lie Arg Leu Met Thr Leu Lys Lys Glu Arg Leu 
340 345 350 

Ala Thr Met Gin Phe Lys lie Asn Lys Lys Gin Glu Asp Val Lys Gin 
355 360 365 

Tyr Lys Arg Thr Met lie Glu Asp Cys Asn Lys Val Gin Glu Lys Arg 
370 375 380 

Asp Ala Val Cys Glu Gin Val Thr Ala He Asn Gin Asp He His Lys 
385 390 395 400 

He Lys Ser Gly He Gin Gin Leu Arg Asp Ala Glu Lys Arg Glu Lys 
405 410 415 

Leu Lys Ser Gin Glu He Leu Val Asp Leu Lys Ser Ala Leu Glu Lys 
420 425 430 

Tyr His Glu Gly He Glu Lys Thr Thr Glu Glu Cys Cys Thr Arg He 
435 440 445 

Gly Gly Lys Thr Ala Glu Leu Lys Arg Arg Met Phe Lys Met Pro Pro 
450 455 460 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
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(B) CLONE: HW052 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

<B) LOCATION: 247.. 765 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION: complement (660.. 1062) 

(D) OTHER INFORMATION: /label= SAC_24882 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GTCGACCAAA AACAGGACAG ATTGCAACAG CCTCTACACT TCCTTCATGG CCTGCTCTTG 60 

GGCCGTGGCG ACAAGCAGAA AGAAGCTTGA TTGTGGAGAG GTGCCCTAGC GGAGGGATGT 120 

CTGCCACAGT CCAGGCACCC CACAGAGGGT AGGAGCTCCA TGTACGAGTG GCTTTGCATT 180 

TCTTCAGAAA AGAATAGCTG ATGTGCAATC TCAGCCTTTC TGCAGAAGGA TTTATCAGGG 240 

GCCAGC ATG GGC TCT GTG AAT TCT AGA GGC CAT AAG GCC CAG GTG GTA 288 
Met Gly Ser Val Asn Ser Arg Gly His Lys Ala Gin Val Val 
15 10 

ATG CTG GGC CTC GAC TGT GCT GGC AAG ACC ACA ATC CTG TAC AAA CTG 336 
Met Leu Gly Leu Asp Cys Ala Gly Lys Thr Thr lie Leu Tyr Lys Leu 
15 20 25 30 

AAA GGA AAC CGG CTG GTG GAT ACC CTA CCC ACT GTT GGT TTT AAT GTA 384 
Lys Gly Asn Arg Leu Val Asp Thr Leu Pro Thr Val Gly Phe Asn Val 
35 40 45 

GAG CCT CTT GAG GCT CCT GGA CAT GTG TCG CTG ACT CTC TGG GAC ATT 432 
Glu Pro Leu Glu Ala Pro Gly His Val Ser Leu Thr Leu Trp Asp lie 
50 55 60 

GGG GGA CAG ACC CAG CTC AGG GCT ACG TGG AAG GAC TAC CTG GAA GGC 480 
Gly Gly Gin Thr Gin Leu Arg Ala Thr Trp Lys Asp Tyr Leu Glu Gly 
65 70 75 

ATT GAC CTC CTT GTG TAC GTG CTG GAC AGC ACA GAT GAA GCC CGC TTG 528 
lie Asp Leu Leu Val Tyr Val Leu Asp Ser Thr Asp Glu Ala Arg Leu 
80 85 90 

CCC GAG GCA GTG GCT GAG CTC GAG GAA GTC CTA GAA GAC CCC AAC ATG 576 
Pro Glu Ala Val Ala Glu Leu Glu Glu Val Leu Glu Asp Pro Asn Met 
95 100 105 110 

GCT GGT GTT CCT TTC TTG GTA CTG GCC AAC AAG CAG GAG GCT CCT GAT 624 
Ala Gly Val Pro Phe Leu Val Leu Ala Asn Lys Gin Glu Ala Pro Asp 
115 120 125 

GCT CTA CCA TTG CTT GAA ATC AGA AAC AGG CTG GAC CTG GAA AGG TTC 672 
Ala Leu Pro Leu Leu Glu lie Arg Asn Arg Leu Asp Leu Glu Arg Phe 
130 135 140 

CAA GAC CAT TGC TGG GAG CTG CGG GCT TGC AGC GCT CTC ACA GGT CAG 720 
Gin Asp His Cys Trp Glu Leu Arg Ala Cys Ser Ala Leu Thr Gly Gin 
145 150 155 

GGG CTA CAG GAA GCC CGG CAG AGC CTG CTG CAT TTG CTG AGA TCC 765 
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Gly Leu Gin Glu Ala Arg Gin Ser Leu Leu His Leu Leu Arg Ser 



160 




165 




170 






TGATGAAATG 


CAGTAACCTC 


TAACTCCAGG 


CTGGAGCAGG 


ACAGGGAGCA 


GGGTCAGCCA 


825 


GACCCCCTGA 


GCAGGCAAAT 


AGATTATCCT 


TGCTTCTAAG 


AACAGGAGGG 


GCCAGCTGAT 


885 


CCTTGAGAAA 


GTGAAGCTCC 


GTTTGCCACA 


TGAGAAACCA 


CGTTTCTCTT 


GGACGGTGTG 


945 


TTTTCTTCTC 


AGGACTTCTT 


CTCAGTGGCG 


GTTATGCGGA 


AAATCAACTG 


TTCTTCAAGT 


1005 


AACAATCGTC 


AGAGGGAATA 


TAAAAACCCT 


CAGAATACCC 


CCATGTAACA 


GCTCAGTACC 


1065 


AAGTGTCAAA 


GTCACAAACT 


TGGTGACATA 


ATTATTGATG 


ACCAAGTTCA 


CAAATGAAAA 


1125 


GAGGTTTCAA 


AACTACATCT 


GAGTGTCACT 


GGGCCTATCG 


CTGTTATCCG 


AGTGTCACTG 


1185 


GGCCTATCAT 


TGTCATCCGA 


GTGTCACCAG 


GCCTATCACG 


GTCATCTGCC 


TGTCACTGGG 


1245 


CCTATCACAG 


TCATCAGGAG 


CGACTCTTCC 


TACTGGCCTT 


GGGGCAGAAG 


CCACTGTCAC 


1305 


TGGTGGGCTC 


ACAGCAGGAG 


GGGCAGGTCT 


GCTCTCATAA 


TGCTGCAGAA 


GGCAGAAGTC 


1365 


CATGGGGCCC 


TAGATCCTCC 


CAGAATTCCC 


CCTGATCTCC 


AACTCTGACT 


AACTCTTTTC 


1425 


TGCCCTATAT 


CCTCTGTTGT 


AAAAAAGTAA 


AATAAAACTC 


CTCTTATACT 


CTCAAAAA 


1483 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Gly Ser Val Asn Ser Arg Gly His Lys Ala Gin Val Val Met 
15 10 15 

Gly Leu Asp Cys Ala Gly Lys Thr Thr lie Leu Tyr Lys Leu Lys 
20 25 30 

Asn Arg Leu Val Asp Thr Leu Pro Thr Val Gly Phe Asn Val Glu 
35 40 45 

Leu Glu Ala Pro Gly His Val Ser Leu Thr Leu Trp Asp lie Gly 
50 55 60 

Gin Thr Gin Leu Arg Ala Thr Trp Lys Asp Tyr Leu Glu Gly lie 
65 70 75 

Leu Leu Val Tyr Val Leu Asp Ser Thr Asp Glu Ala Arg Leu Pro 
85 90 95 

Ala Val Ala Glu Leu Glu Glu Val Leu Glu Asp Pro Asn Met Ala 
100 105 110 

Val Pro Phe Leu Val Leu Ala Asn Lys Gin Glu Ala Pro Asp Ala 
115 120 125 



Leu 

Gly 

Pro 

Gly 

Asp 
80 

Glu 
Gly 
Leu 
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Pro Leu Leu Glu lie Arg Asn Arg Leu Asp Leu Glu Arg Phe Gin Asd 
130 135 140 

His Cys Trp Glu Leu Arg Ala Cys Ser Ala Leu Thr Gly Gin Gly Leu 
14 5 150 155 160 

Gin Glu Ala Arg Gin Ser Leu Leu His Leu Leu Arg Ser 
165 170 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW054 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1952.. 2152 

(D) OTHER INFORMATION: /label= SAC_24183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GTCGACGGTG GGGTCTAACT GAGGTCTTGT ATTTGGTTTT TCCTGAAGAT CGACCTCTGG 60 

CATCAGGAGG CTTGAGCAAG GAGAAGAGTG GATGATGAAG GAAAGAGACA AGGAAAGAAA 120 

GAGAAGTGTG CACAAAGAAA CTTGTATTAT TATTAATTAG CACCTAGCTT GTTTGTGTCT 180 

GATACACCAC CAAGTAGTAA TTGTTGAAAA AAGCGAAGAA GAAAAAAAAA AAAACAAAAA 240 

AACCAAACAG TGGGTACTCA AATAAGATAG GAGAAAAATG AAGAGAACAG ACCCAGTTCT 300 

CGACCCTTGC TTCTCGAAGG TCCTCCCACC AGGCTGCCAA AGCAAGATGG TGTTGCTCTG 360 

ATCCAGTCAG TATTCTTTTG ACTTTTTTTT TTAATCTCCA GGTTTTGGTT CAGGCTCCCA 420 

TATTCATACC CTGGCTCATT TAGCTTTCCC TCATGTTGTG GGTTCTTCTG TCCCTCACCC 480 

CCTTACTCTC CCCACTGATA TTCTTCCCAC GTCAAGACTG TGGCTCTGGA AGAAATATCC 540 

ACCATTTGCA GAGCTGATGT TCTGTAGATC GTAATGTTGA AGCGCTGGGT GTCCTGGTTG 600 

GCAGAATCAC TCCTGTATTA CTCTGGTACA TAGGTGTCTC CTGNTAGACT CCCTGGCCTT 660 

AGTCATGGGG TGTTTTCTAG AGGCAGACTA AGACAGGAGT CAAAAAAGAT TTAGAGGAAG 720 

GAGCTGAGGA AAGAAAGACA GTTGTGGGAG GAAAATCAAG TTCTACTCAG GATCCCGAGT 780 

GTTTCTGTAG ATGTAGATTG GAATGTGTCC ATAACAGAGA GGCCAGTGAG AGACATCCCC 840 
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AAGGAPPTGP 


P A GGPTTTYT* 


TTPPPTPP A r» 




CATCACTCAA AAGG ggtttc 


900 


CTAGAAAGAA 


AG APAAGTG A 


PTTAA A A A AT 


L» xviv.U Avj XvjVj 


OTTCTTGAAG TCATCGAACC 


960 


TAGGAGGTAG 


A A AGP TP & PP. 


PPPPTP A TO T 

VjVJV3\JT X A X\3 1 


x A X \ JoAAWiv7 


ACjGTGaGGTT cagtcaggct 


1020 


AACTATTAGG 


GAGANTGTPA 

VJAVJAll X w X wn 


AAGGAGGCTA 


CAGAAATGGG 


ATTCTGGCTG TGGGGCATTA 


i AAA 

1080 


AAGGCTATCA 


AGTPAAPAGG 


TAAGAGTGGA 


GTGGCTCAGC 


AGCTTCTCTG GCGCTGAAAG 


1140 


TGGTGTPTPT 


T AAP AGTT A A 


TTCCCTTCCC 


TTTTCCATCA 


CAGAAGGAAT CATTCCAGCC 


1200 


CTPPPTGGGA 


TTATGGTPPP 


CCAGTGACCA 


TGTTCCATCT 


GCTGGATTTT TGATCAGTTT 


1260 


AGAAGGAAAA 


A A AP A CTTCiCZ 
nnnwrVVv X X OVjr 


GTTCGGTGGT ATTTGCTTAT GACCTTAGCA CTTTGGAGCT 


1320 


GAGAAGAGGA 


TTCPPATAf^T 
X X X AvJ X 


TTCCAGACCA 


GTCTGGTTTA 


CATAGTGAGT TCTAGGGCTG 


1380 


PGTAATGAGA 


TYITPPT A APA 
x \j Xl» X C/vrloA 


ATGGAGAGGG GCGTTAGGGA GAACTTACTT AGCTCTTCAA 


1440 


G & & & PTT2TPP 


TPTA A iipia a 
X vj X AAAl« AAA 


GAGAAATGAG TCGATGGGTG ACCAGTGGGA GCCTNCCCCC 


1500 


v-AV-L, ILL X\»A 




CTTTTCCATT AAGGGGCCTG TGCATCTCAG TAAGTGGTCC 


1560 


X VmmTVO X x X J.\»l» 


TP. A PP A TTVT 


GATGGTGAGA GTAGAGAGAG CAAAAAGATT CTCAGCCCTG 


1620 


fT/lATOTAf" A A 


v.L X x vivj rTCL. 


ACTCTTTTCC 


CGTTCTGGTC 


TGCCTGAGCT CCCCCCCCCC 


1680 


f2 nnt^nnr* a fit* 


GGTCC TGTTT 


GCTGTCCAGA 


CATTGCACCT 


GCCAGTCTTC GGTGCAGCCA 


1740 


gttgta gp an 


IV TP A TT & A A P 
A ion xLAAAVj 


CAACCAGNGG 


TCCAAGAACA 


GGGACAATCA ACAAACAGTG 


1800 


TTATPTTPPA 


VjV*./ilj\jOL.Al- X 


CTCAGGAAGT 


GAGGATCTGA 


ATGGTAAAGC AGAACAGAGT 


1860 


V7X X ILULIVI 


ATPPPAPAPP 
A X \ JV* V* AL. no L» 


CGCTCATGCC 


ACACCCACTT 


GAAGCCTCCA AAGCCAGNAC 


1920 


AAGPTGTAGA 


GTPP ATPA AT 


GTAGATGAGG 


TACATTGTGC 


GCCTTGCTTT TACTTCCTTT 


1980 


AGPTTPTGAT 

«VJ\» X X Vw X un X 


TGAATTTAPP 


TGTATCTTTT 


TAACCGATTT 


ATCGTAGAAN ACAATGGGTG 


2040 




MTV' A TPPTTT 
AM 1 LA xoV~ III 


GTTGATCTTC 


AGCCACACGC 


TGCCCTCCCT CGCCCCCACT 


2100 


TCTAGCATTT 


GTGTGTGTGC 


GCGTGTTTTT TTGAAGAGGG GTTNTTGCTT GTACAATCCC 


2160 


TGGCTGGCCT 


TGAACTTGAG 


AAGTCCTCCT 


GCCTCCGCCT 


TCCAGGTGCT GGGATTACAA 


2220 


GTTGTCAACC 


CTGCTTTCAC 


TAATGTTTCC 


TTTTTATTAA 


AGATTTATTT ATTATTTAAA 


2280 


AAGTCGAC 










2288 


(2) INFORMATION FOR SEQ ID NO: 49: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW055 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 214.. 1329 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1953.. 2065 

(D) OTHER INFORMATION: /label= SAC_24449 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GTCGACAATG CTTTCAAGCA AACAGCCGCT ACCAGAACAA CTCTGTGACT AGGAGAAAGA 60 

GGCCAAGGCG CGAGTTCCCA AACCCTGAAG CCACAGGAGC CCAAGGGGGA GACTGGAGAG 120 

AAGGTATTTC TAGCCGTGCA CAGTGACTGA TCAGCCGAGG ACCTACCCTC GAGTTCTATT 180 

CTGCAACAGT GAACAGTGAA GGTCTCCAAA GAG ATG GAG TAC GAG GGT TAC AAC 234 

Met Glu Tyr Glu Gly Tyr Asn 
1 5 

GAC TCC AGC ATC TAC GGT GAG GAG TAT TCT GAC GGC TCG GAC TAC ATC 282 
Asp Ser Ser He Tyr Gly Glu Glu Tyr Ser Asp Gly Ser Asp Tyr He 
10 15 20 

GTG GAC TTG GAG GAG GCG GGT CCA CTG GAG GCC AAG GTG GCC GAG GTC 330 
Val Asp Leu Glu Glu Ala Gly Pro Leu Glu Ala Lys Val Ala Glu Val 
25 30 35 

TTC CTG GTG GTA ATC TAC AGC TTG GTG TGC TTC CTC GGG ATC CTA GGC 378 
Phe Leu Val Val He Tyr Ser Leu Val Cys Phe Leu Gly He Leu Gly 
40 45 50 55 

AAT GGC CTG GTG ATT GTC ATC GCC ACC TTC AAG ATG AAG AAG ACG GTG 426 
Asn Gly Leu Val He Val He Ala Thr Phe Lys Met Lys Lys Thr Val 
60 65 ~ " 70 

AAC ACC GTG TGG TTT GTC AAC CTG GCC GTG GCT GAC TTC CTG TTC AAC 474 
Asn Thr Val Trp Phe Val Asn Leu Ala Val Ala Asp Phe Leu Phe Asn 
75 80 85 

ATC TTC TTG CCC ATC CAC ATC ACC TAT GCC GCT ATG GAC TAC CAC TGG 522 
He Phe Leu Pro He His He Thr Tyr Ala Ala Met Asp Tyr His Trp 
90 95 100 

GTG TTC GGG AAA GCC ATG TGC AAG ATT AGT AGC TTT CTG CTA AGC CAC 570 
Val Phe Gly Lys Ala Met Cys Lys He Ser Ser Phe Leu Leu Ser His 
105 110 115 

AAC ATG TAC ACC AGC GTC TTC CTG CTC ACT GTC ATC AGC TTC GAC CGC 618 
Asn Met Tyr Thr Ser Val Phe Leu Leu Thr Val He Ser Phe Asp Arg 
120 125 130 135 

TGC ATC TCC GTG CTC CTC CCC GTC TGG TCC CAG AAC CAC CGC AGC GTG 666 
Cys He Ser Val Leu Leu Pro Val Trp Ser Gin Asn His Arg Ser Val 
140 145 150 

CGT CTG GCC TAC ATG ACC TGC GTG GTT GTC TGG GTC CTG GCT TTC TTC 714 
Arg Leu Ala Tyr Met Thr Cys Val Val Val Trp Val Leu Ala Phe Phe 
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155 160 165 

CTG AGT TCC CCG TCC CTC GTC TTC CGG GAC ACG GTC AGC ACC AGC CAC 762 
Leu Ser Ser Pro Ser Leu Val Phe Arg Asp Thr Val Ser Thr Ser His 
170 175 180 

GGG AAG ATA ACC TGC TTC AAC AAC TTC AGC CTG GCC GCG CCC GAG CCT 810 
Gly Lys lie Thr Cys Phe Asn Asn Phe Ser Leu Ala Ala Pro Glu Pro 
185 190 195 

TTC TCT CAT TCC ACC CAC CCG CGA ACA GAC CCG GTA GGG TAC AGC AGA 858 
Phe Ser His Ser Thr His Pro Arg Thr Asp Pro Val Gly Tyr Ser Arg 
200 205 210 215 

CAT GTG GCG GTC ACC GTC ACC CGC TTC CTC TGT GGC TTC CTG ATC CCC 906 
His Val Ala Val Thr Val Thr Arg Phe Leu Cys Gly Phe Leu lie Pro 
220 225 230 

GTC TTC ATC ATC ACG GCC TGT TAC CTC ACC ATC GTC TTC AAG TTG CAG 954 
Val Phe He He Thr Ala Cys Tyr Leu Thr He Val Phe Lys Leu Gin 
235 240 245 

CGC AAC CGC CTG GCC AAG ACC AAG AAG CCC TTC AAG ATC ATC ATC ACC 1002 
Arg Asn Arg Leu Ala Lys Thr Lys Lys Pro Phe Lys He He He Thr 
250 255 260 

ATC ATC ATC ACC TTC TTC CTC TGC TGG TGC CCC TAC CAC ACA CTC TAC 1050 
He He He Thr Phe Phe Leu Cys Trp Cys Pro Tyr His Thr Leu Tyr 
265 270 275 

CTG CTG GAG CTC CAC CAC ACG GCT GTG CCA GCC TCT GTC TTC AGC CTG 1098 
Leu Leu Glu Leu His His Thr Ala Val Pro Ala Ser Val Phe Ser Leu 
280 285 290 295 

GGA CTG CCC CTG GCC ACA GCC GTT GCC ATC GCC AAC AGC TGT ATG AAC 1146 
Gly Leu Pro Leu Ala Thr Ala Val Ala He Ala Asn Ser Cys Met Asn 
300 305 310 

CCC ATC CTG TAC GTC TTC ATG GGC CAC GAC TTC AAA AAA TTC AAG GTG 1194 
Pro He Leu Tyr Val Phe Met Gly His Asp Phe Lys Lys Phe Lys Val 
315 320 325 

GCC CTT TTC TCC CGC CTG GTG AAT GCC CTG AGC GAG GAC ACA GGA CCC 1242 
Ala Leu Phe Ser Arg Leu Val Asn Ala Leu Ser Glu Asp Thr Gly Pro 
330 335 340 

TCC TCC TAC CCC AGT CAC AGG AGC TTC ACC AAG ATG TCC TCA TTG ATT 1290 
Ser Ser Tyr Pro Ser His Arg Ser Phe Thr Lys Met Ser Ser Leu He 
345 350 355 

GAG AAG GCT TCA GTG AAT GAG AAA GAG ACC AGC ACC CTC TGAGCCTTCC 1339 
Glu Lys Ala Ser Val Asn Glu Lys Glu Thr Ser Thr Leu 
360 365 370 

CTGGGACTGT CCCCAATGGG CTCCACAGCC CAGGGACTTG TCTCCTGAGG TAGGAGACAC 1399 

GCCTGGAGCC TTTGGGTATG CCCCAGCGCC CGCTACGTCT TGCGCAAGGC GGCCCATTGT 1459 

TTTGGGTGGA GTTCCCAAGT GTGGACACTC TTCCAGCAAA ATGGCAGACA AGCAACCTGA 1519 

GCCCCTGCAG CAGGAGTATA GGACCAACTT CCACCGGCTC AGAAAAGGGA GGGTCTCTGA 1579 

AACCAAGGTC TGAGCTGTGA CCCACATACA GGCACACAAT TTCACTGTGG ATGCCCCAAA 1639 
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CATGCTGCAC 


AGTCTCCAGT 


GTGGGTGAGG 


ACGTCACTGC 


AGACCTGTTA 


TGGCGACATG 


1699 


ACAGTCAAAC 


CAGCAAATAC 


CCCACCACCA 


CTGTCATCCT 


CCAAGATCTT 


GACTTTGGAT 


1759 


TTCAGAAAAA 


TGGGGAGGAC 


CAGGACCTGA 


GGGGCTTCAT 


AGAACTTGCT 


TGCAAGGGCA 


1819 


GCCCAGGTGT 


GTGTGTGTGT 


ATGTGCTTGT 


GTGTGTGTGT 


GTGTGTATGT 


GCATGTGTGT 


1879 


GTGTGTGTGT 


GTGTGTGTGT 


GTGTGTGTTG 


AAGAGGTGGA 


GACAGAGAGC 


AGAGAACCCC 


1939 


AAAGGATGTG 


GGTACAGGAT 


ATCTCACACC 


ATCATGCCAC 


AGGGAGACAG 


AGCTACAACG 


1999 


ATGGGTGCTT 


GGGGGTTCAG 


ATGGTGGCAG 


CTCTTTGCTT 


CAGTGTGTAT 


ACCCCCCTAG 


2059 


CACTGTACCC 


CCAGGCTGGA 


GGGGGGCTCT 


TCACAGAAGC 


TTCCAACCCA 


TTCCAGCACT 


2119 


GGCTGGAAGG 


GCTTGGGATG 


TCCCTGGAGA 


CTGGTCTGGG 


ACTATCTTCA 


AAGCGTCTTC 


2179 


CAAAGGCTGC 


TGCTGACCAG 


CTCACCTGAC 


TCCTCCTTGA 


ACACACTCCC 


CACGTGGGGC 


2239 


ATCACAGCCC 


CGATGGCAGC 


CCAGGATCAG 


GCCTCCCTCC 


CTTGACAGGG 


AAATGACTCA 


2299 


CCCAACCCCA 


TGCACTGTCA 


GTAACAAAGA 


CCCAGGCCCA 


GCTCCAAAAA 




2349 



(2) INFORMATION FOR SEQ ID NO J 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Glu Tyr Glu Gly Tyr Asn Asp Ser Ser lie Tyr Gly Glu Glu Tyr 
1 5 10 - 15 

Ser Asp Gly Ser Asp Tyr lie Val Asp Leu Glu Glu Ala Gly Pro Leu 
20 25 30 

Glu Ala Lys Val Ala Glu Val Phe Leu Val Val He Tyr Ser Leu Val 
35 40 45 

Cys Phe Leu Gly He Leu Gly Asn Gly Leu Val He Val He Ala Thr 
50 55 60 

Phe Lys Met Lys Lys Thr Val Asn Thr Val Trp Phe Val Asn Leu Ala 
65 70 75 80 

Val Ala Asp Phe Leu Phe Asn He Phe Leu Pro He His He Thr Tyr 
85 90 95 

Ala Ala Met Asp Tyr His Trp Val Phe Gly Lys Ala Met Cys Lys He 
100 105 110 

Ser Ser Phe Leu Leu Ser His Asn Met Tyr Thr Ser Val Phe Leu Leu 
115 120 125 

Thr Val lie Ser Phe Asp Arg Cys He Ser Val Leu Leu Pro Val Trp 
130 135 140 
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Ser Gin Asn His Arg Ser Val Arg Leu Ala Tyr Met Thr Cys Val Val 
145 150 155 160 

Val Trp Val Leu Ala Phe Phe Leu Ser Ser Pro Ser Leu Val Phe Arg 
165 170 175 

Asp Thr Val Ser Thr Ser His Gly Lys He Thr Cys Phe Asn Asn Phe 
180 185 190 

Ser Leu Ala Ala Pro Glu Pro Phe Ser His Ser Thr His Pro Arg Thr 
195 200 205 

Asp Pro Val Gly Tyr Ser Arg His Val Ala Val Thr Val Thr Arg Phe 
210 215 220 

Leu Cys Gly Phe Leu He Pro Val Phe He He Thr Ala Cys Tyr Leu 
225 230 235 240 

Thr He Val Phe Lys Leu Gin Arg Asn Arg Leu Ala Lys Thr Lys Lys 
245 250 255 

Pro Phe Lys He He He Thr He He He Thr Phe Phe Leu Cys Trp 
260 265 270 

Cys Pro Tyr His Thr Leu Tyr Leu Leu Glu Leu His His Thr Ala Val 
275 280 285 

Pro Ala Ser Val Phe Ser Leu Gly Leu Pro Leu Ala Thr Ala Val Ala 
290 295 300 

He Ala Asn Ser Cys Met Asn Pro He Leu Tyr Val Phe Met Gly His 
305 310 315 320 

Asp Phe Lys Lys Phe Lys Val Ala Leu Phe Ser Arg Leu Val Asn Ala 
325 330 335 

Leu Ser Glu Asp Thr Gly Pro Ser Ser Tyr Pro Ser His Arg Ser Phe 
340 345 350 

Thr Lys Met Ser Ser Leu He Glu Lys Ala Ser Val Asn Glu Lys Glu 
355 360 365 

Thr Ser Thr Leu 
370 

<2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2577 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW056 

(ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) LOCATION: 8.. 523 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: complement (738.. 953) 

(D) OTHER INFORMATION: /label= SAC_24326 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GTCGACG GGC TAC CTT AGC CAG CTG ATG CTT CCA GTG ACA GAG CCA CTT 49 
Gly Tyr Leu Ser Gin Leu Met Leu Pro Val Thr Glu Pro Leu 
1 5 10 

TGT CCT TTG CAC AGC CTG ACA CCC TAT CAG GTC CCC TTC AAT GCA GTT 97 
Cys Pro Leu His Ser Leu Thr Pro Tyr Gin Val Pro Phe Asn Ala Val 
15 20 25 30 

GCT ATC CGG GTC ACG CAC GCC GAC GTT GCA CCT ACC CAC ATA CTG TAC 145 
Ala He Arg Val Thr His Ala Asp Val Ala Pro Thr His He Leu Tyr 
35 40 45 

GCT ATG AAT GCC AGC TGG GTC GGC CTT TGC AAG ATT GTG GAT GAT ATG 193 
Ala Met Asn Ala Ser Trp Val Gly Leu Cys Lys He Val Asp Asp Met 
50 55 60 

AAA GGC TAC ACT CGG GGC CCC ATT CTG CTG GCC CAG AAC CCC ATA TGT 241 
Lys Gly Tyr Thr Arg Gly Pro He Leu Leu Ala Gin Asn Pro He Cys 
65 70 75 

GAC TGT TTG GGC TTT GGT ATC TGC AGA GGC ATT GAC ATG GAC AAG CGG 289 
Asp Cys Leu Gly Phe Gly He Cys Arg Gly He Asp Met Asp Lys Arg 
80 85 90 

CTG TAC CAC ATC CTC ACC CCT CTA CCA CCA GAG GAG TTA AGA ACT GTG 337 
Leu Tyr His He Leu Thr Pro Leu Pro Pro Glu Glu Leu Arg Thr Val 
95 100 105 HO 

AAC TGT CTG CTG GTT GGC ACC ATT TCC ATT CCA CAT TGT ATT TTC AAG 385 
Asn Cys Leu Leu Val Gly Thr He Ser He Pro His Cys He Phe Lys 
115 120 * 125 

AAT CAG CCT GGG ACT GAA GGG ACA GTT CCT TAT GTC ACC AGG GAT TAT 433 
Asn Gin Pro Gly Thr Glu Gly Thr Val Pro Tyr Val Thr Arg Asp Tyr 
130 135 140 

AAT TTA AAA CTT CTT GGA GCA TCA GAG AAA ATT GGG GAG AGA GAG TAT 481 
Asn Leu Lys Leu Leu Gly Ala Ser Glu Lys He Gly Glu Arg Glu Tyr 
145 150 155 

AGA AAT ATA CTA CCT AGA CAC AAA TCA CGG CAA AGA AGA AAA 523 
Arg Asn He Leu Pro Arg His Lys Ser Arg Gin Arg Arg Lys 
160 165 ** 170 



TGAAGCCTGG 


GTGTACAGAG 


AGACTTTCGC 


CTCGACACAT 


TGTCCCCAAG 


CCTAGAGTCA 


583 


GCAGACTGAG 


CGGGCTGTGT 


GGCTGCTGAC 


CTCGTGCCCA 


TGCCCAGTGT 


TCATATAACT 


643 


GGAAAATGTG 


CTGTGGTAAG 


AGTCATTCAA 


GCCCCTTCGT 


TTTTTTCCCC 


TCTAAGACAT 


703 


GGAGGGCCTC 


ATATAGCTCA 


GACTGGCCTC 


AGTACTCTGG 


TCCTCCTGTC 


TCACTGTCCA 


763 
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AGGGCTGCGC 


ATCCGGCTTT 


TGGTCTCATA 


CTTGGCCCAG 


ACCTTGCCGA 


CTGGTTGTCC 


823 


TGGGTGTGTG 


GTGCTCCTGG 


AGAGCTGGGA 


AGCATTGAAG 


AAAGCTGTGG 


TGTCCTCCAA 


883 


GAAGACCAGG 


GCCACAAAAG 


TGTGCCTAGA 


AACTGAGTAG 


CAGGCTGACC 


ACAGGTTCTC 


943 


GGGTGCAGTG 


TACGGACTGG 


CAGGGACCTG 


AGCAGGCTCA 


GTGGCCGTGA 


GTAAGAGGCT 


1003 


TACTCCGTTG 


AACCTTGTGG 


ACCTTAGCAC 


CCCAGTGTGG 


CGGCTCATTG 


GCCTGATCCT 


1063 


TTGATGCTTG 


CCATCGAATC 


TCCCAATGCT 


GCAATCAAGT 


GTTAGAGGAC 


GGTACGTCGT 


1123 


CTTGTTAATT 


TCTGGGTCTT 


TAATCATGAG 


GCTTTATGTT 


GAAGAACTTG 


CTCTGACATT 


1183 


CAGAAAGCCT 


CACGTTAGTT AAAGAAATCC 


ATGGTTCCCT 


AATTCTGGGG 


ATGTTCTGAC 


1243 


AGACCTTGAG 


/-i r+trrr* rp TV O T\ T» A 

GCTGTAGAT. A 


Lu a\_AvjAAAvj 


CCTCCATGTT 


AGTTAAGGAA 


ATCCATGGTT 


1303 


CCCTAATTCT 


GGGGATG A rC 


1\jAv_ AviGGAA 


TGTTCTGACA 


GACCTTGAGG 


CTGAGATACT 


1363 


ACCATGGGCC 


LACTAGC.C.TG 


GGACAGGCAL. 


ACATATGCCT 


GGCTTGGGCA 


GACCTTTGCT 


1423 


ATCAACGATT 


TCTTCTAAGG 


L. vU L LAGAGGC 


CACTGCAAGT 


GGATGTTGCA ATCCGGAGTG 


1483 


ACTGGGTCGG 


GTCATTTTTG 


Ivjvj A AoL. AAA 


GCAGGACCCT 


GAGGAGCCTC 


TTGGGCTGGC 


1543 


ATGATGGATC 


JAiAL. AAlj AC 1 




TGGAATCAGG 


AAGGACCAAG 


GCTATGGAAG 


1603 


GCTCACAACA 


TTGATTGTCT 


CLb Av- 1 I\j A\_ 


TGTAGTCATA 


GGAGTGTGGC 


CAGTCTCCTC 


1663 


TGCTTGGCGC 


tnlAy, X 1 Avj I Vj 


PTPlTTTTVlfi 
Vj A VjVjt-L. 


CATCTGAAAA 


TCCTCCGAGG AGGCAGAGCT 


1723 


GCCTCTTCTG 


un 1 iAu 1 v_vj»Vj 


a A ivLw Av9 AA 


GATCACTCGG 


GCTCACAACT 


ACTCCCTGCC 


1783 


CCACCCCCAA 


l»AV«l>A AA-. A \- X 


\3 1 v# A unv« i i o 


AGCTTGTGGT 


GAGCCTTTGC 


AGAGGGGAGG 


1843 


AGCTCTTCCT 


GCCCAGGCAC 


TCTCTGGGCT 


CCTGATCCTC 


AAGCCTCTGC 


TTCCTGGGTG 


1903 


CTCAGATCAC 


AGGTGTGTGC 


CGTCTTAGCC 


TTGCGGACAG 


TTTTAAGACA GCCTTGGTCA 


1963 


CCTTCATCTC 


CCTCCCGGGG 


TCTTCCTGTC 


TTCACAATAA 


AGATCCCGAG 


TCCAAAAAAA 


2023 


AAAAAAAAAA 


AAATTCCAGG 


AGGAAATCTG 


GGTGCACAGA 


ACTTATCTGA 


GTCGGCTTTG 


2083 


GTGAAGGAGA 


TCCTCAAGGA 


GCAGGAGAAC 


AGGAAGGGCC 


TCATAGCTGC 


CATCTGTGCG 


2143 


GGTCCTACGG 


CCCTGCTGGC 


TCACGAAGTA 


GGCTTTGGAT 


GCAAGGTTAC 


ATCGCACCCA 


2203 


TTGGCTAAGG 


ACAAAATGAT 


GAACGGCAGT 


CACTACAGCT 


ACTCAGAGAG 


CCGTGTGGAG 


2263 


AAGGACGGCC 


TCATCCTCAC 


CAGCCGTGGG 


CCTGGGACCA 


GCTTCGAGTT 


TGCGCTGGCC 


2323 


ATTGTGGAGG 


CACTCAGTGG 


CAAGGACATG 


GCTAACCAAG 


TGAAGGCCCC 


GCTTGTTCTC 


2383 


AAAGACTAGA 


GCCCAAGCCC 


TGGACCCTGG 


ACCCCCAGGC 


TGAGCAGGCA TTGGAAGCCC 


2443 


ACTAGTGTGT 


CCACAGCCCA 


GTGAACCTCA 


GGAACTAACG 


TGTGAAGTAG 


CCCGCTGCTC 


2503 


AGGAATCTCG 


CCCTGGCTCT 


GTACTATTCT 


GAGCCTTGCT AGTAGAATAA ACAGTTCCCC 


2563 


AAGCTCCTGA AAAA 










2577 
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(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly Tyr Leu Ser Gin Leu Met Leu Pro Val Thr Glu Pro Leu Cys Pro 
15 10 15 

Leu His Ser Leu Thr Pro Tyr Gin Val Pro Phe Asn Ala Val Ala lie 
20 25 30 

Arg Val Thr His Ala Asp Val Ala Pro Thr His lie Leu Tyr Ala Met 
35 40 45 

Asn Ala Ser Trp Val Gly Leu Cys Lys He Val Asp Asp Met Lys Gly 
50 55 60 

Tyr Thr Arg Gly Pro He Leu Leu Ala Gin Asn Pro He Cys Asp Cys 
65 70 75 80 

Leu Gly Phe Gly He Cys Arg Gly He Asp Met Asp Lys Arg Leu Tyr 
85 90 95 

His He Leu Thr Pro Leu Pro Pro Glu Glu Leu Arg Thr Val Asn Cys 
100 105 110 

Leu Leu Val Gly Thr He Ser He Pro His Cys He Phe Lys Asn Gin 
115 120 125 

Pro Gly Thr Glu Gly Thr Val Pro Tyr Val Thr Arg Asp Tyr Asn Leu 
130 135 140 

Lys Leu Leu Gly Ala Ser Glu Lys He Gly Glu Arg Glu Tyr Arg Asn 
145 150 155 160 

He Leu Pro Arg His Lys Ser Arg Gin Arg Arg Lys 
165 170 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW057 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7. .1023 
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(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1032.. 1176 

(D) OTHER INFORMATION: /label* SAC_23926 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GTCGAC GGC GCG CTG GAG GAC GCA CGG CTT GGC GGG AAG GCG AGG GGA 48 
Gly Ala Leu Glu Asp Ala Arg Leu Gly Gly Lys Ala Arg Gly 
15 10 

CCG TGC GGA GCC TCC GGT GCT GGC GGC GGC CAC CAG TTC GGG AGC CGG 96 
Pro Cys Gly Ala Ser Gly Ala Gly Gly Gly His Gin Phe Gly Ser Arg 
15 20 25 30 

GAG CCG GCC GCA GCA CGC GGA ACA GAG CAG AGG GTG GCG GGG CCC GGC 144 
Glu Pro Ala Ala Ala Arg Gly Thr Glu Gin Arg Val Ala Gly Pro Gly 
35 40 45 

GAA ACC ACG ATG AAG CCG AAC AGC GAG GAA GAG GAG GAG TTG GTG CAG 192 
Glu Thr Thr Met Lys Pro Asn Ser Glu Glu Glu Glu Glu Leu Val Gin 
50 55 60 

GGC GTG GGC CCC TGG GAC GAG TGC TTC GAG GTG GCC GTG CAG TTG GCG 240 
Gly Val Gly Pro Trp Asp Glu Cys Phe Glu Val Ala Val Gin Leu Ala 
65 70 75 

TTG CGT GCG GGA CAA ATC ATC AGA AAG GCC CTC ACT GAG GAA AAA CAC 288 
Leu Arg Ala Gly Gin lie lie Arg Lys Ala Leu Thr Glu Glu Lys His 
80 85 90 

GTC TCG ACG AAA ACA TCT GCT GCA GAT CTT GTG ACA GAA ACA GAT CAC 336 
Val Ser Thr Lys Thr Ser Ala Ala Asp Leu Val Thr Glu Thr Asp His 
95 100 105 110 

CGA GTA GAA GAC TTA ATT GTT TCT GAG TTG CGA AAG CGG TTC CCT TCA 384 
Arg Val Glu Asp Leu lie Val Ser Glu Leu Arg Lys Arg Phe Pro Ser 
115 120 125 

CAC AGG TTC ATT GCA GAA GAG GCC ACA GCC TCC GGG GCC AAG TGT GTG 432 
His Arg Phe lie Ala Glu Glu Ala Thr Ala Ser Gly Ala Lys Cys Val 
130 135 140 

CTC ACC CAC AGC CCG ACC TGG ATC ATC GAC CCC ATC GAC GGC ACC TGC 480 
Leu Thr His Ser Pro Thr Trp He He Asp Pro He Asp Gly Thr Cys 
145 150 155 

AAC TTT GTG CAC AGA TTC CCC ACT GTG GCA GTT AGC ATC GGA TTT GCT 528 
Asn Phe Val His Arg Phe Pro Thr Val Ala Val Ser He Gly Phe Ala 
160 165 170 

GTT CAC CAG GAG CTG GAA TTC GGA GTG ATT CAC CAC TGC ACA GAG GAG 576 
Val His Gin Glu Leu Glu Phe Gly Val He His His Cys Thr Glu Glu 
175 180 185 190 

CGG CTG TAC ACC GGC AGG AGG GGC CAG GGC GCC TTT TGC AAT GGC CAG 624 
Arg Leu Tyr Thr Gly Arg Arg Gly Gin Gly Ala Phe Cys Asn Gly Gin 
195 200 205 

AGG CTC CAG GTC TCC AGG GAG ACA GAT CTC GCA AAG GCC TTG GTT CTG 672 
Arg Leu Gin Val Ser Arg Glu Thr Asp Leu Ala Lys Ala Leu Val Leu 
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210 215 220 

ACA GAA ATC GGG CCC AAA CGT GAC CCC GAT ACT CTG AAA GTA TTC CTG 720 
Thr Glu He Gly Pro Lys Arg Asp Pro Asp Thr Leu Lys Val Phe Leu 
225 230 235 

AGC AAC ATG GAG CGG CTG CTG CAC GCC AAG GCT CAT GGG GTC CGA GTG 768 
Ser Asn Met Glu Arg Leu Leu His Ala Lys Ala His Gly Val Arg Val 
240 245 250 

ATT GGC AGC TCC ACC TTG GCG CTC TGC TAC TTG GCC TCG GGG GCT GCT 816 
He Gly Ser Ser Thr Leu Ala Leu Cys Tyr Leu Ala Ser Gly Ala Ala 
255 260 265 270 

GAT GCC TAT TAC CAG TTC GGC CTC CAC TGC TGG GAT CTG GCA GCT GCC 864 
Asp Ala Tyr Tyr Gin Phe Gly Leu His Cys Trp Asp Leu Ala Ala Ala 
275 280 285 

ACA GTC ATC ATC AGA GAA GCA GGT GGC ATT GTG ATT GAC ACC TCA GGT 912 
Thr Val He He Arg Glu Ala Gly Gly He Val He Asp Thr Ser Gly 
290 295 300 

GGA CCC CTT GAC CTC ATG TCG TGC AGA GTG GTG GCT GCT GGC ACC AGA 960 
Gly Pro Leu Asp Leu Met Ser Cys Arg Val Val Ala Ala Gly Thr Arg 
305 310 315 

GAG ATG GCA GTG CTC ATA GCT CAG GCC CTA CAA ACC ATT AAC TAC GGC 1008 
Glu Met Ala Val Leu He Ala Gin Ala Leu Gin Thr He Asn Tyr Glv 
320 325 330 

CGG GAC GAT GAG AAG TGAGCCGTAC AGAGCTCTAA GGCTGACATG AGCAGCTCCC 1063 

Arg Asp Asp Glu Lys 

335 



TGGGAAAGAG 


CTGTCCGGGG 


GCTTGAGTTC 


CGGGATAGTC 


TACCATAGCT 


GTCCCCGGAC 


1123 


CTCGGTGCTT 


AGCTGATCCT 


CTCTAATCTC 


GGGTAGCCCC 


TTTCCAGGTC 


GGTACATGGT 


1183 


CTTTCATCAG 


AGCCAAACCC 


AAATCTTGTG 


AGGTGTGTTA 


GTCACCCATC 


CTGGTTGTTC 


1243 


GGAATGCAAA 


TCTCAGGTAA 


TAAAGCTTTA 


GAACGAGCTC 


TCAGGCCCTC 


CCCTGCCCGT 


1303 


GGTGATACAG 


AATGCAATAA 


ATCAGAACTT TAAAAA 






1339 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Ala Leu Glu Asp Ala Arg Leu Gly Gly Lys Ala Arg Gly Pro Cys 
1 5 10 " is 

Gly Ala Ser Gly Ala Gly Gly Gly His Gin Phe Gly Ser Arg Glu Pro 
20 25 30 

Ala Ala Ala Arg Gly Thr Glu Gin Arg Val Ala Gly Pro Gly Glu Thr 
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35 40 45 

Thr Met Lys Pro Asn Ser Glu Glu Glu Glu Glu Leu Val Gin Gly Val 
50 55 60 

Gly Pro Trp Asp Glu Cys Phe Glu Val Ala Val Gin Leu Ala Leu Arg 
65 70 75 80 

Ala Gly Gin lie lie Arg Lys Ala Leu Thr Glu Glu Lys His Val Ser 
85 90 95 

Thr Lys Thr Ser Ala Ala Asp Leu Val Thr Glu Thr Asp His Arg Val 
100 105 110 

Glu Asp Leu He Val Ser Glu Leu Arg Lys Arg Phe Pro Ser His Arg 
115 120 125 

Phe He Ala Glu Glu Ala Thr Ala Ser Gly Ala Lys Cys Val Leu Thr 
130 135 140 

His Ser Pro Thr Trp He He Asp Pro He Asp Gly Thr Cys Asn Phe 
145 150 155 ~ 160 

Val His Arg Phe Pro Thr Val Ala Val Ser He Gly Phe Ala Val His 
165 170 175 

Gin Glu Leu Glu Phe Gly Val He His His Cys Thr Glu Glu Arg Leu 
180 185 190 

Tyr Thr Gly Arg Arg Gly Gin Gly Ala Phe Cys Asn Gly Gin Arg Leu 
195 200 205 

Gin Val Ser Arg Glu Thr Asp Leu Ala Lys Ala Leu Val Leu Thr Glu 
210 215 220 

He Gly Pro Lys Arg Asp Pro Asp Thr Leu Lys Val Phe Leu Ser Asn 
225 230 235 240 

Met Glu Arg Leu Leu His Ala Lys Ala His Gly Val Arg Val He Gly 
245 250 255 

Ser Ser Thr Leu Ala Leu Cys Tyr Leu Ala Ser Gly Ala Ala Asp Ala 
260 265 270 

Tyr Tyr Gin Phe Gly Leu His Cys Trp Asp Leu Ala Ala Ala Thr Val 
275 280 285 

He He Arg Glu Ala Gly Gly He Val He Asp Thr Ser Gly Gly Pro 
290 295 300 

Leu Asp Leu Met Ser Cys Arg Val Val Ala Ala Gly Thr Arg Glu Met 
305 310 315 320 

Ala Val Leu He Ala Gin Ala Leu Gin Thr He Asn Tyr Gly Arg Asp 
325 330 335 

Asp Glu Lys 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2285 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW059 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

<B) LOCATION: 9. .1271 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1991.. 2162 

(D) OTHER INFORMATION: /label= SAC_24457 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

GTCGACAG CCC GCC ACT CCA GGC GCG ATG CTG TTC TGG ACT GTG CTC AGC 50 
Pro Ala Thr Pro Gly Ala Met Leu Phe Trp Thr Val Leu Ser 
15 10 

ATG GCT TTG AGT CTG CGG TTG GCA CTG GCG CAG AGC GGC ATA GAG CGC 98 
Met Ala Leu Ser Leu Arg Leu Ala Leu Ala Gin Ser Gly lie Glu Arg 
15 20 25 30 

GGT CCC ACA GCA TCA GCC CCC CAG GGG GAC CTG TTG TTC CTG CTG GAC 146 
Gly Pro Thr Ala Ser Ala Pro Gin Gly Asp Leu Leu Phe Leu Leu Asp 
35 40 45 

AGC TCA GCC AGT GTG TCA CAC TAT GAG TTC TCA AGA GTT CGA GAG TTT 194 
Ser Ser Ala Ser Val Ser His Tyr Glu Phe Ser Arg Val Arg Glu Phe 
50 55 60 

GTG GGG CAG CTG GTG GCT ACG ATG CCT TTC GGA CCC GGG GCT CTG CGT 242 
Val Gly Gin Leu Val Ala Thr Met Pro Phe Gly Pro Gly Ala Leu Arg 
65 70 75 

GCT AGT CTG GTG CAC GTG GGC AGC CGG CCT CAC ACG GAG TTT ACT TTT 290 
Ala Ser Leu Val His Val Gly Ser Arg Pro His Thr Glu Phe Thr Phe 
80 85 90 

GAT CAG TAC AGT TCA GGC CAG GCC ATA CAG GAT GCT GTA CGT GTT GCG 338 
Asp Gin Tyr Ser Ser Gly Gin Ala He Gin Asp Ala Val Arg Val Ala 
95 100 105 110 

CCC CAA CGT ATG GGT GAC ACC AAC ACG GGC CTG GCA CTG GCT TAT GCC 386 
Pro Gin Arg Met Gly Asp Thr Asn Thr Gly Leu Ala Leu Ala Tyr Ala 
115 120 125 

AAA GAA CAA TTA TTT GCT GAG GAA GCA GGT GCC CGG CTA GGG GTT CCC 434 
Lys Glu Gin Leu Phe Ala Glu Glu Ala Gly Ala Arg Leu Gly Val Pro 
130 135 140 

AAG GTA CTG GTG TGG GTG ACA GAT GGG GCC TCC AGT GAC TCT GTG GGC 482 
Lys Val Leu Val Trp Val Thr Asp Gly Ala Ser Ser Asp Ser Val Gly 
145 150 155 
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CCC CCT ATG CAG GAG CTC AAG GAC CTG GGT GTC ACC ATC TTC ATT GTC 530 
Pro Pro Met Gin Glu Leu Lys Asp Leu Gly Val Thr He Phe He Val 
160 165 170 

AGC ACT GGC CGA GGC AAC CTG TTG GAG CTG TTG GCA GCT GCC TCA GCT 578 
Ser Thr Gly Arg Gly Asn Leu Leu Glu Leu Leu Ala Ala Ala Ser Ala 
175 180 185 190 

CCG GCT GAG AAG CAC CTA CAC TTT GTG GAT GTG GAT GAC CTT CCT ATC 626 
Pro Ala Glu Lys His Leu His Phe Val Asp Val Asp Asp Leu Pro He 
195 200 205 

ATT GCC CGG GAG CTT CGG GGT GCC ATT ATT GAT GCG ATG CAG CCA CAT 674 
He Ala Arg Glu Leu Arg Gly Ala He He Asp Ala Met Gin Pro His 
210 215 220 

CAG CTT CAT GCT TCG GAG ATT CTG TCC AAT GGC TTC CGC CTG TCC TGG 722 
Gin Leu His Ala Ser Glu lie Leu Ser Asn Gly Phe Arg Leu Ser Trp 
225 230 235 

CCG CCC CTG CTG ACA GCG GAC TCT GGT TAC TAC GTG CTG GAG TTG GTG 770 
Pro Pro Leu Leu Thr Ala Asp Ser Gly Tyr Tyr Val Leu Glu Leu Val 
240 245 250 

CCC AGT GGC AAA CTG GCA GCC ACA AGA CGC CAA CAG CTG CCC GGG AAT 818 
Pro Ser Gly Lys Leu Ala Ala Thr Arg Arg Gin Gin Leu Pro Gly Asn 
255 260 265 270 

££ C o GC ^ ACC 160 ACC GAC CTC AAC CCA GAC ACA GAT TAC GAA 866 
Ala Thr Ser Trp Thr Trp Thr Asp Leu Asn Pro Asp Thr Asp Tyr Glu 
275 280 285 

OTA TCG CTG TTG CCG GAG TCC AAT GTG CGC CTC CTG AGG CCC CAG CAC 914 
Val Ser Leu Leu Pro Glu Ser Asn Val Arg Leu Leu Arg Pro Gin His 
290 295 300 

TTG CGA GTA CGC ACA CTG CAA GAG GAG GCA GGG CCA GAA CGC ATC GTC 962 
Leu Arg Val Arg Thr Leu Gin Glu Glu Ala Gly Pro Glu Arg He Val 
305 310 315 

ATC TCG CAT ACT AGG CCG CGC AGC CTC CGT GTA AGT TGG GCC CCC GCA 1010 
He Ser His Thr Arg Pro Arg Ser Leu Arg Val Ser Trp Ala Pro Ala 
320 325 330 

CTT GGC CCG GAC TCC ACT CTC GGC TAC CTT GTA CAG CTC GGA CCT CTG 1058 
Leu Gly Pro Asp Ser Thr Leu Gly Tyr Leu Val Gin Leu Gly Pro Leu 
335 340 345 350 

CAG GGC GGA TCC CTA GAA CAC GTG GAG GTG CCA GCT GGC CAG AAC AGC 1106 
Gin Gly Gly Ser Leu Glu His Val Glu Val Pro Ala Gly Gin Asn Ser 
355 360 365 

ACT ACC ATC CAG GGC CTG ACG CCC TGC ACC ACT TAC CTG GTG ACT GTG 1154 
Thr Thr lie Gin Gly Leu Thr Pro Cys Thr Thr Tyr Leu Val Thr Val 
370 375 380 

ACT GCC GCC TTC CGC TCG GGC CGC CAG AGG GCG CTG TCG GCT AAG GCC 1202 
Thr Ala Ala Phe Arg Ser Gly Arg Gin Arg Ala Leu Ser Ala Lys Ala 
385 390 395 

TGT ACG GCA TCT GGC GAG CGG ATC CGT GTC CCG CAG GCC ATG CGG CCG 1250 
Cys Thr Ala Ser Gly Glu Arg He Arg Val Pro Gin Ala Met Arg Pro 
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400 405 410 



GAG GCC GGA CTG CGG GAG CCC TGACCTTCCT GCCTCGTCCA CCCGAGGGGC 1301 
Glu Ala Gly Leu Arg Glu Pro 
415 420 



CCTCTTCCCT 


AATCCAGTGA 


GAGAGGCACC 


GCTGCTCGTG 


GTTTTTCTTG 


TGGATGGAGT 


1361 


GGGGTGGGGA 


GATGGGGGTG 


CTGGTCCTAC 


CTTTGACCTG 


CGTAACTCCT 


CCGGTCGTTT 


1421 


CCCCACTGGT 


CATCACCGCC 


CTTGCCTGAC 


TCCCGGGAAA 


CCTGTAGCAA 


CCCTGGTAGC 


1481 


CTCACGCGCA 


ATGACAATCC 


TCTCCGGTTG 


CCAGTGGAGT 


TGAGCACACG 


GTGGTCCTTG 


1541 


GGCAACATTT 


GGCGAGGGGA 


TGGACAGTGT 


TTGAGGTCAG 


GTTGAGACCC 


AGGAGAAGCA 


1601 


TTCAGGAGAG 


GAGGCCACAG 


AGTTTTCTAC 


CTGTGCCAAA 


GACTGGGCCC 


TCTGGTGGCA 


1661 


AGGACTACAC 


ATGGCTTGGA 


GGAAATGTCC 


AGGACCCTTC 


AAGTCCTGCC 


TGTGCCTAGA 


1721 


AAGTGGGTAG 


GAGAAAGGGA 


GGAGAGAGTA 


GTGTAGGCAA 


GGTTCCCAAA GACTTCCTTA 


1781 


GGGCAAGGAA 


AGGTAGGGAG 


AGATACCGGG 


AGGCTGATGA 


TGTTGCCAAC 


TAGTTTTCAT 


1841 


CAAGATTTTC 


TGCCAGCCTG 


GAGGCCAGGA 


TCTGTCAGGG 


TCACTGACTC 


TGCCTTCCTG 


1901 


CCCAGGACCT 


GCACTGGGCC 


CCGCTATCAG 


TGCGGGGGGG 


GGGGTGCAGA GTCTTCACAG 


1961 


GAATGGGGGA 


TGAGACCTTG 


GCATGTAGTA 


CATTGGGGAT 


AGGAGAGCCC 


TGCCGTGACA 


2021 


GACTTACAGG 


GAGTCTCCTG 


CTTTAGTGTA 


GGGAGCAAGG 


TGACATGCAG 


GTGGGCTACC 


2081 


TCCTGTCATC 


ACTACTGCCC 


TGGGGCATCT 


GACAGATACC 


TAAGGGTAGT 


CAGGAACAGG 


2141 


CTTCCTCTCC 


AGTCCCTATG 


TACGCAGAGC 


CCCTCCTCCC 


CAGAACCTCT 


TGCCCTAACC 


2201 


TAAGCTTACT 


CCATCTCTCT 


TCCCCACTAA 


TGACCCAGAC 


TCTAACAATA 


ATACAGTAAG 


2261 


CCAGATGTAA 


ACTGTGAAGT 


CGAC 








2285 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Pro Ala Thr Pro Gly Ala Met Leu Phe Trp Thr Val Leu Ser Met Ala 
15 10 15 

Leu Ser Leu Arg Leu Ala Leu Ala Gin Ser Gly He Glu Arg Gly Pro 
20 25 30 

Thr Ala Ser Ala Pro Gin Gly Asp Leu Leu Phe Leu Leu Asp Ser Ser 
35 40 45 

Ala Ser Val Ser His Tyr Glu Phe Ser Arg Val Arg Glu Phe Val Gly 
50 55 60 
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Gin Leu Val Ala Thr Met Pro Phe Gly Pro Gly Ala Leu Arg Ala Ser 
65 70 75 80 

Leu Val His Val Gly Ser Arg Pro His Thr Glu Phe Thr Phe Asp Gin 
85 go 95 

Tyr Ser Ser Gly Gin Ala He Gin Asp Ala Val Arg Val Ala Pro Gin 
100 105 no 

Arg Met Gly Asp Thr Asn Thr Gly Leu Ala Leu Ala Tyr Ala Lys Glu 
115 120 125 

Gin Leu Phe Ala Glu Glu Ala Gly Ala Arg Leu Gly Val Pro Lys Val 



135 



140 



Leu Val Trp Val Thr Asp Gly Ala Ser Ser Asp Ser Val Gly Pro Pro 

145 1 c n 



150 



155 



160 



Met Gin Glu Leu Lys Asp Leu Gly Val Thr He Phe He Val Ser Thr 
165 170 175 

Gly Arg Gly Asn Leu Leu Glu Leu Leu Ala Ala Ala Ser Ala Pro Ala 
180 185 190 

Glu Lys His Leu His Phe Val Asp Val Asp Asp Leu Pro He He Ala 
195 200 205 

Arg Glu Leu Arg Gly Ala He He Asp Ala Met Gin Pro His Gin Leu 
210 215 220 

His Ala Ser Glu He Leu Ser Asn Gly Phe Arg Leu Ser Trp Pro Pro 



230 



235 



240 



Leu Leu Thr Ala Asp Ser Gly Tyr Tyr Val Leu Glu Leu Val Pro Ser 
245 250 255 

Gly Lys Leu Ala Ala Thr Arg Arg Gin Gin Leu Pro Gly Asn Ala Thr 
260 265 270 

Ser Trp Thr Trp Thr Asp Leu Asn Pro Asp Thr Asp Tyr Glu Val Ser 
275 280 285 

Leu Leu Pro Glu Ser Asn Val Arg Leu Leu Arg Pro Gin His Leu Arg 
290 295 300 

Val Arg Thr Leu Gin Glu Glu Ala Gly Pro Glu Arg He Val He Ser 



310 



315 



320 



His Thr Arg Pro Arg Ser Leu Arg Val Ser Trp Ala Pro Ala Leu Gly 
325 330 335 

Pro Asp Ser Thr Leu Gly Tyr Leu Val Gin Leu Gly Pro Leu Gin Gly 
340 345 350 

Gly Ser Leu Glu His Val Glu Val Pro Ala Gly Gin Asn Ser Thr Thr 
355 360 355 

He Gin Gly Leu Thr Pro Cys Thr Thr Tyr Leu Val Thr Val Thr Ala 
370 375 380 

Ala Phe Arg Ser Gly Arg Gin Arg Ala Leu Ser Ala Lys Ala Cys Thr 



390 



395 



400 
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Ala Ser Gly Glu Arg lie Arg Val Pro Gin Ala Met Arg Pro Glu Ala 
405 410 415 

Gly Leu Arg Glu Pro 
420 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW061 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 7.. 1422 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

<B) LOCATION: complement (2089.. 2287) 

(D) OTHER INFORMATION: /label= SAC_24029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

GTCGAC GCG ACG GTG GCG GAG CAG TGG GTG CTG GTG GAG ATG GTG CAG 48 
Ala Thr Val Ala Glu Gin Trp Val Leu Val Glu Met Val Gin 
15 10 

GCG CTG TAC GAG GCT CCA GCA TAC CAT CTA ATT CTG GAA GGA ATC CTC 96 
Ala Leu Tyr Glu Ala Pro Ala Tyr His Leu He Leu Glu Gly He Leu 
15 20 25 30 

ATA CTT TGG ATA ATC AGA CTC GTT TTC TCT AAA ACT TAC AAG TTG CAG 144 
He Leu Trp He He Arg Leu Val Phe Ser Lys Thr Tyr Lys Leu Gin 
35 40 45 

GAG CGT TCT GAC CTT ACA GCC AAG GAA AAG GAA GAA CTG ATT GAA GAA 192 
Glu Arg Ser Asp Leu Thr Ala Lys Glu Lys Glu Glu Leu He Glu Glu 
50 55 60 

TGG CAG CCA GAG CCC CTC GTC CCC CCG GTC TCC AGG AAC CAT CCT GCT 240 
Trp Gin Pro Glu Pro Leu Val Pro Pro Val Ser Arg Asn His Pro Ala 
65 70 75 

CTC AAC TAC AAC ATC GTC TCC GGC CCT CCA ACC CAT AAC ATC GTG GTG 288 
Leu Asn Tyr Asn He Val Ser Gly Pro Pro Thr His Asn He Val Val 
80 85 90 

AAC GGA AAA GAG TGT GTC AAC TTT GCC TCC TTT AAT TTC CTC GGG TTG 336 
Asn Gly Lys Glu Cys Val Asn Phe Ala Ser Phe Asn Phe Leu Gly Leu 
95 100 105 HO 
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CTG GCC AAC CCT CGG GTT AAG GCT GCG GCT TTT GCA TCA TTA AAG AAG 384 
Leu Ala Asn Pro Arg Val Lys Ala Ala Ala Phe Ala Ser Leu Lys Lys 
115 120 125 

TAT GGC GTG GGT ACC TGT GGT CCC AGA GGG TTT TAT GGC ACA TTT GAT 432 
Tyr Gly Val Gly Thr Cys Gly Pro Arg Gly Phe Tyr Gly Thr Phe Asp 
130 135 140 

GTC CAT CTG GAT TTG GAA GAG CGC CTG GCA AAG TTT ATG AAG ACC GAA 480 
Val His Leu Asp Leu Glu Glu Arg Leu Ala Lys Phe Met Lys Thr Glu 
145 150 ~ 155 

GAA GCT ATC ATT TAC TCC TAT GGC TTC TCC ACC ATA GCC AGT GCG ATT 528 
Glu Ala He He Tyr Ser Tyr Gly Phe Ser Thr He Ala Ser Ala lie 
160 165 170 

CCT GCG TAC TCT AAG AGA GGG GAC ATT GTT TTT GTG GAC AGT GCA GCC 576 
Pro Ala Tyr Ser Lys Arg Gly Asp He Val Phe Val Asp Ser Ala Ala 
175 180 185 190 

TGC TTT GCT ATC CAG AAA GGA CTA CAG GCA TCA CGC AGT GAC ATT AAG 624 
Cys Phe Ala He Gin Lys Gly Leu Gin Ala Ser Arg Ser Asp He Lys 
195 200 205 

TTA TTC AAG CAC AAT GAC GTA GCT GAC CTG GAG CGA CTG CTA AAA GAA 672 
Leu Phe Lys His Asn Asp Val Ala Asp Leu Glu Arg Leu Leu Lys Glu 
210 215 220 

CAA GAG ATT GAA GAT CAA AAG AAT CCT CGA AAG GCC CGT GTG ACT CGG 720 
Gin Glu He Glu Asp Gin Lys Asn Pro Arg Lys Ala Arg Val Thr Arg 
225 230 235 

CGA TTC ATC GTC GCG GAA GGA TTG TAT ATG AAC ACT GGA ACC ATC TGC 768 
Arg Phe He Val Ala Glu Gly Leu Tyr Met Asn Thr Gly Thr He Cys 
240 245 250 

CCT CTT CCA GAA CTG GTG AGG TTA AAG TAT AAA TAC AAA GCA AGG ATC 816 
Pro Leu Pro Glu Leu Val Arg Leu Lys Tyr Lys Tyr Lys Ala Arg He 
255 260 265 270 

TTC CTG GAG GAG AGC CTG TCG TTC GGA GTC CTT GGG GAG CAT GGG CGA 864 
Phe Leu Glu Glu Ser Leu Ser Phe Gly Val Leu Gly Glu His Gly Arg 
275 280 285 

GGA GTC ACC GAG CAC TAT GGG ATC AGT ATT GAC GAC ATC GAC CTT ATC 912 
Gly Val Thr Glu His Tyr Gly He Ser He Asp Asp He Asp Leu He 
290 295 300 

AGT GCT AAC ATG GAG AAT GCG CTC GCT TCT GTT GGG GGC TTC TGC TGT 960 
Ser Ala Asn Met Glu Asn Ala Leu Ala Ser Val Gly Gly Phe Cys Cvs 
305 310 315 

GGC CGC TCT TTC GTG GTT GAC CAT CAG CGG CTC TCT GGC CAA GGA TAC 1008 
Gly Arg Ser Phe Val Val Asp His Gin Arg Leu Ser Gly Gin Glv Tvr 
320 325 330 

TGC TTT TCA GCT TCA CTG CCC CCC CTG CTA GCC GCT GCC GCC ATT GAG 1056 
Cys Phe Ser Ala Ser Leu Pro Pro Leu Leu Ala Ala Ala Ala He Glu 
335 3 40 345 350 

GCC CTC AAC ATC ATG GAG GAG AAC CCA GGG ATT TTT GCA GTT TTA AAG 1104 
Ala Leu Asn He Met Glu Glu Asn Pro Gly He Phe Ala Val Leu Lys 
3 55 360 365 
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AAA AAG TGC CAG ACC ATC CAC AAG TCC CTA CAA GGG GTT TCC GGT TTA 1152 
Lys Lys Cys Gin Thr He His Lys Ser Leu Gin Gly Val Ser Gly Leu 
370 375 380 

AAA GTG GTG GGG GAG TCC CTT TGC CCA GCG CTT CAC CTC CAG CTG GAA 1200 
Lys Val Val Gly Glu Ser Leu Cys Pro Ala Leu His Leu Gin Leu Glu 
385 390 395 

GAG AGC ACG GGC TCT CGG GAG AGA GAT ATG AAG CTG CTT CAG GAG ATT 1248 
Glu Ser Thr Gly Ser Arg Glu Arg Asp Met Lys Leu Leu Gin Glu He 
400 405 410 

GTA GAG CAA TGC ATG AAT AAG GGC ATC GCA TTG ACT CAG GCA CGC TAC 1296 
Val Glu Gin Cys Met Asn Lys Gly He Ala Leu Thr Gin Ala Arg Tyr 
415 420 425 430 

TTG GAC AAG GAA GAG AAG TGC CTT CCT CCT CCA AGC ATC AGG GTT GTG 1344 
Leu Asp Lys Glu Glu Lys Cys Leu Pro Pro Pro Ser He Arg Val Val 
435 440 445 

GTC ACC GTG GAG CAG ACA GAC GAA GAG CTG CAG AGG GCT GCA GCC ACC 1392 
Val Thr Val Glu Gin Thr Asp Glu Glu Leu Gin Arg Ala Ala Ala Thr 
450 455 460 

ATC AGG GAG GCG GCC CAG GCT GTG CTT TTG TAGGCTCCTG CCCAGTGCTT 1442 
He Arg Glu Ala Ala Gin Ala Val Leu Leu 
465 470 



GCGACCATGT CACCTGCAGA 


CAGAACTACT 


CAGACCTCCA GCTGCCCAGG 


GCGTGGAGTG 


1502 


CTGCCCAGGG CGTGGCTGCC 


CTTGAGTCCA 


CAGCCAGAAT 


GGATGGCTTT 


ACCCAGCAGG 


1562 


CTTCCAGAGG ATGGCAGACG 


ACGGTGTAAC 


ACTGGTGTTT 


ATGCGGTGCC 


ATGGCTCTGT 


1622 


CTCTTATACT CGCCTTTGTT 


TAAAACTGGA 


GCTTCCGTTT 


TCCCTCTCCC 


GCCTGTGGGA 


1682 


ACAATTCGAT AAGCACTCCT 


CTGTATTCCA 


TACAAAAGGA 


CTTGCAGCTG 


AGAGGAAGTC 


1742 


TCCACGCACC ACAGTTACTG 


TGGGATGATA 


GACTGTGGCC 


TCCCAGGTCC 


TTCGGGTAGA 


1802 


GCTCAGTGGT GGCTGCTGTG 


CAGACAACTC 


CTGTGTGTCA AAACTGTCAT 


AGAACATGTT 


1862 


GATCCTAGAA CTCTGTGTGC 


TCCAGTTACT 


GATGCTGTCG 


TGTCTAATCT 


GAGCTTCACT 


1922 


AACAATAGTC ATACCGCTCC 


CTAAGCAATA 


ATCTGAATCA 


AACATTATTT 


TATTTTTGTG 


1982 


TAATTGACTT TATATTTTTT 


ATATATTCTC 


TAGTCAGTTT 


TGGGAAGAGG 


GTTATTATTC 


2042 


TGGTCCCCAA ATATATAAAT 


CATATCCTTG 


GTATTTTTTA ATATGTACGG 


TGTCCTTTTT 


2102 


CATGGTGGTT TTGTCCAGGA 


ACATAGAACT 


GTGGGCCTTT 


TCTAAGCTGA 


GAAAAGAGAG 


2162 


AACTTTCTTC TTTCGTTCTA 


TAAATTCCAG 


ATGCCTCAGA 


AAATAGGAAT 


GCTCTGAAAA 


2222 


CTGCTGCTGT CCTTAGCGAG 


CGTGCTGAGG 


GGTGTAACGG 


CTTGTGACAT 


GTGCCGCGAT 


2282 


CGTGTACATG GAAGGCCATT 


ACTCCCTCTC 


GCTGCACAGG 


CAGAGGAATG 


GGGCCTCTAA 


2342 


CCGTCGTTTT CGGATGTTCA 


TTCATGTTAA 


CAGTGGACGG 


CATCTTCCTA 


AGTGATCTGT 


2402 


GACATGTTTA ATGTGTCTAT 


TTCATACGTG 


CCTTGTGAAT 


GCTGCTGCTG 


TGAGGGTACG 


2462 
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GGGATGAGTC TGTTTTGGTT TGGTTTTTAA TGAAATAAAC CTCCAAGAGC CTACAAAAA 2521 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Ala Thr Val Ala Glu Gin Trp Val Leu Val Glu Met Val Gin Ala Leu 
15 10 15 

Tyr Glu Ala Pro Ala Tyr His Leu lie Leu Glu Gly lie Leu He Leu 
20 25 30 

Trp He He Arg Leu Val Phe Ser Lys Thr Tyr Lys Leu Gin Glu Arg 
35 40 45 

Ser Asp Leu Thr Ala Lys Glu Lys Glu Glu Leu He Glu Glu Trp Gin 
50 55 60 

Pro Glu Pro Leu Val Pro Pro Val Ser Arg Asn His Pro Ala Leu Asn 
65 70 75 80 

Tyr Asn He Val Ser Gly Pro Pro Thr His Asn He Val Val Asn Gly 
85 90 95 

Lys Glu Cys Val Asn Phe Ala Ser Phe Asn Phe Leu Gly Leu Leu Ala 
100 105 HO 

Asn Pro Arg Val Lys Ala Ala Ala Phe Ala Ser Leu Lys Lys Tyr Gly 
115 120 125 

Val Gly Thr Cys Gly Pro Arg Gly Phe Tyr Gly Thr Phe Asp Val His 
130 135 140 

Leu Asp Leu Glu Glu Arg Leu Ala Lys Phe Met Lys Thr Glu Glu Ala 
145 150 155 160 

He He Tyr Ser Tyr Gly Phe Ser Thr He Ala Ser Ala He Pro Ala 
165 170 175 

Tyr Ser Lys Arg Gly Asp He Val Phe Val Asp Ser Ala Ala Cys Phe 
180 185 190 

Ala He Gin Lys Gly Leu Gin Ala Ser Arg Ser Asp He Lys Leu Phe 
195 200 205 

Lys His Asn Asp Val Ala Asp Leu Glu Arg Leu Leu Lys Glu Gin Glu 
210 215 220 

He Glu Asp Gin Lys Asn Pro Arg Lys Ala Arg Val Thr Arg Arg Phe 
225 230 235 240 

He Val Ala Glu Gly Leu Tyr Met Asn Thr Gly Thr He Cys Pro Leu 
245 250 255 

Pro Glu Leu Val Arg Leu Lys Tyr Lys Tyr Lys Ala Arg He Phe Leu 
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260 265 270 

Glu Glu Ser Leu Ser Phe Gly Val Leu Gly Glu His Gly Arg Gly Val 
275 280 285 

Thr Glu His Tyr Gly He Ser He Asp Asp He Asp Leu He Ser Ala 
290 295 300 

Asn Met Glu Asn Ala Leu Ala Ser Val Gly Gly Phe Cys Cys Gly Arg 
305 310 315 320 

Ser Phe Val Val Asp His Gin Arg Leu Ser Gly Gin Gly Tyr Cys Phe 
325 330 335 

Ser Ala Ser Leu Pro Pro Leu Leu Ala Ala Ala Ala He Glu Ala Leu 
340 345 350 

Asn He Met Glu Glu Asn Pro Gly He Phe Ala Val Leu Lys Lys Lys 
355 360 365 

Cys Gin Thr He His Lys Ser Leu Gin Gly Val Ser Gly Leu Lys Val 
370 375 380 

Val Gly Glu Ser Leu Cys Pro Ala Leu His Leu Gin Leu Glu Glu Ser 
385 390 395 400 

Thr Gly Ser Arg Glu Arg Asp Met Lys Leu Leu Gin Glu He Val Glu 
405 410 415 

Gin Cys Met Asn Lys Gly He Ala Leu Thr Gin Ala Arg Tyr Leu Asp 
420 425 430 

Lys Glu Glu Lys Cys Leu Pro Pro Pro Ser He Arg Val Val Val Thr 
435 440 445 

Val Glu Gin Thr Asp Glu Glu Leu Gin Arg Ala Ala Ala Thr He Arg 
450 455 460 

Glu Ala Ala Gin Ala Val Leu Leu 
465 470 

(2) INFORMATION FOR SEQ ID NO: 59: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW062 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: complement (87.. 223) 

(D) OTHER INFORMATION: /label= SAC_24443 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

TTTTTGTGAG CAAATCCAAA TTTATTTTAA TGCCATGTCA TTTCAAAAAC CCCAGCCTTG 60 

GTTCCCTGGA ACAGCATGCC AGAGGTACAA AGTGTAACCT TCCTCTATAA ACCCCCAGCA 120 

AGTCATCCAA GTCCTCAGCT TCAGAAAGTC AGTTCTCTTC AGTCTACTCC TCTGGTTCCT 180 

GGTTTTTCTT TCAAGGAAGG CAACTTAAAA ATTAGGTAGC TGAGGTTCCA ATGTTGGTGG 240 

CTAACCTTGC CTCCTCTTTT GTTCTGTAGT AGGGCCACAA ACCTTGACCA GGCAGAGTAG 300 

TAGAAAAGTA GAAAGAGGGG CTTGACGACG GTGGATTTTG ACTCCTGATT TTATTATTCA 360 

ATTTCTTTTT CTACTAAAAG TAGTCTTCGG TGGTTGGGAA GCCTGGCCTC CCAACACCAG 420 

AGTCAGTCGG AGCTGGTTTT TTTGTTGAAA GGAGTGGGCG GGTGGGTGGG GGACCGGGAT 480 

GAGGGCAGAA CCCCGCTCTG CTGGTAGTCT TGGGTGGAGA AGACGAACTG CACTTGACAG 540 

AGCCTGGGGG TGCGGTGGGA GGGGGTGAGG CAGGAGTGAC AGCTGGGGAG GGGACCCA 598 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW066 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 759.. 906 

(D) OTHER INFORMATION: /label* SAC_24521 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

ATTCGCGGCC GGCTCGACTT TTTTTTTTGT GATATAAAAA ATTTATTTTA GAAATCACAT 60 

CTTTAAAAAA CAACATAGAA GGTTATCCTG ACCAGATATA AATTCTTCAT TCAATGCCAG 120 

GGTGTTATGT GGTACAATTA GCGGGATGTT CTGTGAGGAG AATGAGTCTG TGTGATACTC 180 

TCGACATGTA CACCAGCTCA AACGTTAATG CGGCTTTTAA AAGGAAGGAG AGAGGAAGGC 240 

AGCGAGAGAG GAGGAAACTG TTTTGGAAAC AGAAGAGCAT CTCCGTCCTG CACCTGTCAG 300 

CCCTGAAACT CCCGGCCCTT GAGACCAGCC CAGCAGAGGC TGCACGAANN NGCATCATTT 360 

TAATTCACAA GTGACTGCAG CACCCTGGCA TCTTGTGCTG GGAGTGTGGG TGGGAACAGA 420 
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GGTGACAAGT CTGCTCTGTA GTGTCATAGC CAGAGAGCAA CAGGGACTCT AGAGGAAGAA 480 

GAGCAAAGCT TGCCTAGACA GAACGGGTTG TCCCTGTCCC TGTCCATCAG CTTTCTTATA 540 

ACCTATGGNT AGCCAGGATC AAATACCTGA AAGTCAGAAT GTAAAATAAA AGTGTCAGTT 600 

AACACTTATT GCGGACAGAA ATACAGCTCT TTACGCCTAC AACCATAAAC CCAGTGGTGA 660 

GGCTGGGACA GACCTTTTAT GCCGAAGTTC TCACAACTCG GGACTGACTT ACTGGGGCCT 720 

GGTCACTGCT TAATTGGTGA CCTCACACCA TTTCAGGTAC TTCTGCCCAG CTCCTGATTC 780 

CTCTTCAGGA GAAACAGGAA GCAGACTTTG TTAGTAAGTG GGTCATGCAG GGCTGGGCCG 840 

CCTACATGGG ACTTAACACA GAGAGAATAC TGCCAGCTTC CCAAACACTT CAAGAGAGAA 900 

AAATGTCCAG ACGTGGGTGC TGCAGCAACC TTCCAAGAGT GAGGCCCCTG CACCCTCCTG 960 

CACCCTCGGC TGGGGACAGT GGGCTCTGGT GGCTTGAACT GAGTAGATCG GATCATTCCC 1020 

TCCTCTTTCT GAGACAAGGC TTCACTGTGT AGCCCNNGCT GCCCTGGAGC TCACTATTTA 1080 

GACCAGGCTA GGATCTCACA GAGAACCACC TGCCTCTGGC CTCTCAAGAG CTGGGATTAA 1140 

GGTGTGAAAA GGGGGCTAAT GTAGCCCAGG CTGGCTCCTG CTCCACAGGG CTTGCAGGCC 1200 

AGTTCTGGAT CCTACCAGCA AGTACAAGCT GGTACCCAGA GGACGCTGCA GGATGGAAGG 1260 

ACGGAAGGCC AGGAGCCTTC GTTTGATTTG ACGGGAAGGA GGAGGAACAG TGAGGTTGTG 1320 

CATCTGAGCT TTATTACCAG GACGATGATG TCACCTCAGG AAGGGACTGG GATCCCTGCA 1380 

AGTGGTTCCA CATGGGACCC AGGTCCTCAG AGATGTGGCC CTCCTTGTGT GCCAAAATGC 1440 

ATCTGGAAAG AAAATACAGA AGGGGAGGGA CAGCATAGGA GGAGACAAGT GTGACCCGAG 1500 

GCCCAGGAAG TACTTCCATA TTCTGTCCTT CTCCAGGCAC GAGAGTTGAA GGCCGTGCCA 1560 

AGCCCTGCAC TCGCTAAGCT GGTGAAGACC ACACCAAAAT CACTGTGACA GGACAGCTGC 1620 

ACAGATCTGT TTCCGACTTT TCCCACCACG ATTCTACACC AAGGTAAAAA AGAGCTTCCA 1680 

CCACAAATCC AAGGCAGGAG CTGGGCGGCC ACACACCCGC CACAGATGGT GCAAGTTGCT 1740 

CACTTCCGCA GCCGGTGGCA GGACAGGGAG GAGGGCAGAG TCCTGAGACA AGAACCCCAG 1800 

AGCGGGGCCA GTTCAGTCAG TGGTGGACCT CCGGTACCAG AACCGAGCTC GGAAGTCGTC 1860 

GCTGCACGTC ATGAAGCCAG GGTTGGTCGG GGAGTGCACT ATGCAGCGCA CAATGTTGTT 1920 

GTGGCCCAGG GACAGCAGGT TCCTGCGCTC AGCTGTCCTA GAGTCCCAGC AGCACAGGCT 1980 

GATGGTCCTT TCGTCCGGCA GCAAAATATA GTCCTCAGTG TGGTTGAACA CAGCCTGTGT 2040 

CCGGGTGCAC TTGCCGTCCA CTCAGCCCTG CGCCTGTGTA CCTCACCAGT GTCCGTCCCG 2100 

TGGATATTTC CCCAGAGTTT AGCTACAGAG TCTTTYCCNN GAGTTTAGGC CTACAGAGTC 2160 

TTTTCCACTT GAGAGGATGT ACTTGGAATT CTTGGAGAAA ATGGCAGAAC AGACTTCTGC 2220 

CCCGTCGTGA GCTTTCTCAA AGGTTGTGAT GCAGCGATTG GAGACGCCGT CCCACAGCTT 2280 
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GATGCAGCCG TCCTTGCTGC CAGTGACATA CATGTTGGCA CTAGGGTTAT AGTTGACAGA 2340 

GCAGATGGCG TCGGTGTGCT GGTCTTGGGG ATTGCAGGAG ACGAAACACT GGAAGGTGTT 2400 

GATGTCATAG AGGCGAAGCG TGGGATGCTG GGTCCCGACC AGGATGAAGT CTCCAGAAGG 2460 

GTGGAACGAG ATGGAGCGCA GCATCTCAGC TTCCTGAATG TATTTAAAGG CTCTTTTTGC 2520 

AGAGGGTTTG GAATAATCAA ACAATTTTAG AGTATAATCC CTTGAGCCGG AGGCCAGGAT 2580 

CTGTTCTGTT GGGTGAAAAG CGAGACACGT GACTTCATCC ACATGGTCGT AAAGAGTTCG 2640 

GATCACTGGG TGGTTCTCCA TGTTCTGTTG TGCAGTCTCA TTCATCATGA CCCTCGATGG 2700 

GCATGGCGCT CTTGGCCAGC ATCCTCTCTG TGTCCAGTAT CTTGATGGAA GCATCCGCAG 2760 

ACCCTGTGGC TATCAACTGC CCATCTCTGC TGTAGGTGGC CACTCGGCAT GGGCCTTTGT 2820 

GGGAGGTGAC ATAACAGGTT TCATACTCCG AAGCCTCGGG TGACATTGTT TGGACGTCTG 2880 

CATCAAACTC CAGGTCAATN CTGTGCCAGG GGCAACCGTG TCAGAGCGAC CAATGGCATA 2940 

CTGAACTGCA GTGTCATCGT TCTCCATCCC TAGCTTGATG AGGTGCAGGA GCTGCTCCGA 3000 
GGGCGCACAC ACCGACTGTG GCTTGATCTC GTTGATGAGG CCATTGCGAT GCTAATGTAT 
CCGTCATAGA GCAGCTGGCT GATGATCAGC TTGTAGAGCT GCTGGCGGTC CTTCAGGCCC 

ACTTTGGTTC TGTACATTCT GCACAAGAGG ATGACAGTTC CCCTGACTTC TCCGAGCTGA 3180 

TCACCTCCGC GACAGCCTTG AATCCAAAGG ACTGTGAGTG ACCGGCAAAC GCCCGANCCT 3240 
TGGAGAGCAC NGCGGGCGAT AGAACGGCCG TAGATCGGAC AGGCCGTTGG TGCACCGTCG 
ACCGG 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2099 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW069 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1360.. 1893 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 1744.. 1885 

(D) OTHER INFORMATION: /label= SAC_24477 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
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GTCGACGCAT CCAGGCAGGC ATGGAGCAGG TAGGGCTCAG AGTTCTACAT CTTCATCTGG 60 

AGGCTACTAT GAGGAGACCG ACTTCCAGGC AGCTAGGAAG CCCATGCTCA CAGTGACAGA 120 

CCTTCTCCAA CAAGGTTATA CCTCCTAATC GTGCCGCTCC CTGGGCCAAG CATATGCAAA 180 

CCTTCACACA GGTCTTCCTC CTACACAGTG GGGCTTCTGG GCCCCACTCA CGGCCAGTAC 240 

ATGGCAAAAG ACATTAAAGA TGCACTTGGA CCAGCTCAGG GCCCTGTTCT GCAGCCAGGT 300 

ATCATGGATG GAGGAAGTCC TGACTGAAAC AAAAATGTCT GTCCGCATTG TCATCTTTGA 360 

GAGCCTGAAC AAACCCCGAA CACCTCTGAC CTGGCACAGG GACCCTGAGT CTGCCTGCCT 420 

GGCACTCTGA ACTTGCTGGC TACTGGGCTG AAGTCTCAGG ATCGAGGTAC AAGGACAAAG 480 

CCTGTTGGTA TTATGTAGAC AGGAGCCCCA GGGGGCTCAG AGCAGGGGTT TACCGTTACT 540 

CTTGGTCCAG CCGGCACAGT GGTCTTCCCT CAGTTTTTGT TTTTGTTTTT TTGTTTTTTT 600 

GTTTTGTTTT TTGTTTTTTG TTTGTTTGTT TGTTTTGTTT TGTTTTTTGG TGCTGTTGAC 660 

CCTGCAGTCT GGACTGCCCC CTTGGGCAAT ACATTTTGAT GGAAACCAAC TTCTGTGTTC 720 

CCAGATCGGC AGCCTCTCCA TGCCGAGCAC CCCTGTTCTA GCTAACCCGG ACATGCTACA 780 

CCCCTGCCGT GGTTGGTTCA TCTCCCCACT CCCCGCCGTG TCTTAATAAC ACGGTGGATG 840 

GATGGACAGA CGGACAGATG AAGCTTTGAC AGCTAGCCAC TGCTCTGCTC CTTTTCTCTT 900 

GTCTTCTGAG TAAATTCTCA AACTCAAGCC TCCCAGGGGG AGCCCAGAAC CCCACCCACC 960 

ACTAGTGAAA AAGCAGCTTT TTAGGCAGCT CAGAGAGTGT CCGGAGCAGG GTTCCTACAG 1020 

GCTCAGGTTG GGGGCAGTTT CAGTGTGTGG CTGCCGGCCC AGGGCCCTCC CTGGTCAGCA 1080 

CCTAGGTAGA GAGGAAGCAG GAAGGACCAA GTCCAGCTCA AAGGAACCAG TGAGGCTGGT 1140 

TGAGAGCAGA CGGTGGGTAT GGGTCTGTGA AAAGAAAAGT AGGCCGTTGG GAGCCCACCG 1200 

GAGGACAGTC TGGCTCAAGG CATCACACCT GGTTGCAGCA GCCTCCGAGA CAGCAGCCCG 1260 

TGGGTGTGAA CTTGGCTCCA TCTTGGCTGA ATTCTGCTGC CCCAGAGAAG AATCTGGCAC 1320 

CGGAGTGGAC CCAGATACGT GTCACAGCCT TGGAAATCG ATG GCA GCG AGA CCC 1374 

Met Ala Ala Arg Pro 
1 5 

TTT ATT GGG CAC CTA GAG ATT TCC GAG AGG CGG TTC CTA ACA TTG TCA 1422 
Phe lie Gly His Leu Glu He Ser Glu Arg Arg Phe Leu Thr Leu Ser 
10 15 2 o 

TTC CTT TCC CAC AGG CCT CAG GTC CAG GCC CAA AGC CAA GTG ACT GCC 1470 
Phe Leu Ser His Arg Pro Gin Val Gin Ala Gin Ser Gin Val Thr Ala 
25 30 35 

CTC ATC GGC TCA GCA CAT CGG CGG GTG CTG GAT GTG CCG CCT GCA GAG 1518 
Leu He Gly Ser Ala His Arg Arg Val Leu Asp Val Pro Pro Ala Glu 
« 45 50 

GCT CTT CGG GGC CAG GCT GGG ACT GGG CCC CTG AGA AAA CCC AAA CTC 1566 
Ala Leu Arg Gly Gin Ala Gly Thr Gly Pro Leu Arg Lys Pro Lys Leu 
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55 60 65 

TGT GCC TTA CCC AGT CAG GAT CCT TGG GAG CCT TGG CCC TCG ATG CCA 1614 
Cys Ala Leu Pro Ser Gin Asp Pro Trp Glu Pro Trp Pro Ser Met Pro 
70 75 80 85 

GGA CCT GGC CTA GCA TCA CCC TGG GCC ACC TGC ACG TTC ATG GCC TCA 1662 
Gly Pro Gly Leu Ala Ser Pro Trp Ala Thr Cys Thr Phe Met Ala Ser 
90 95 ioo 

CCA TCT GCT TTG CGA ACC ACT ACC CAC CCC CTG TCG CAT GCA CCT TCC 1710 
Pro Ser Ala Leu Arg Thr Thr Thr His Pro Leu Ser His Ala Pro Ser 
1 05 no us 

ACC CCT GCG GTG TCT GAC TCT AGG GAG ATA GGT ACA GAT GTC TGT GAT 1758 
Thr Pro Ala Val Ser Asp Ser Arg Glu lie Gly Thr Asp Val Cys Asp 

GGG CTG AGA CAT CCC CCA CCC CCC ACG GCT CAT TGG TTT CTT TTA GCT 1806 
Gly Leu Arg His Pro Pro Pro Pro Thr Ala His Trp Phe Leu Leu Ala 
135 140 145 

GAG TGC TGT TTT CTG TGT CTT TAC TCT GCT ATG ATG ACT TGG GGG GTT 1854 
Glu Cys Cys Phe Leu Cys Leu Tyr Ser Ala Met Met Thr Trp Gly Val 
150 15 5 160 165 

AGT GAG GGC CTG AGG GAT GCT TGT TGG AAG TAC TGT GGG TAGGGCCTGG 1903 
Ser Glu Gly Leu Arg Asp Ala Cys Trp Lys Tyr Cys Gly 
170 175 

TAGGTCCTTA GGGAACAAAC AAACAAAAGG CTCATCTGCT CCTGGGTGGT GACCTGGCCA 1963 

CAGCACCTGT CATCTCCCTT GGTGTCTGCC CAGTGGGGAC GAGCTTCCCT GGCTCAACCC 2023 

CTATCAGGTG TGCACTGTTG AGTCTGCGGG CCAGCCGGGA CTTTAACCAA TAAAGAGCAA 2083 
CCTTGTCTGT CAAAAA 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 
(8) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Met Ala Ala Arg Pro Phe He Gly His Leu Glu He Ser Glu Arg Arg 
1 5 10 15 

Phe Leu Thr Leu Ser Phe Leu Ser His Arg Pro Gin Val Gin Ala Gin 
20 25 30 

Ser Gin Val Thr Ala Leu He Gly Ser Ala His Arg Arg Val Leu Asp 
35 40 45 

Val Pro Pro Ala Glu Ala Leu Arg Gly Gin Ala Gly Thr Gly Pro Leu 
50 55 60 

Arg Lys Pro Lys Leu Cys Ala Leu Pro Ser Gin Asp Pro Trp Glu Pro 



2099 
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65 



70 



75 



80 



Trp Pro Ser Met 



Pro Gly Pro Gly Leu 
85 



Ala Ser Pro Trp Ala Thr Cys 
90 95 



Thr Phe Met Ala Ser Pro Ser Ala Leu Arg Thr Thr Thr His Pro Leu 
100 105 no 

Ser His Ala Pro Ser Thr Pro Ala Val Ser Asp Ser Arg Glu He Gly 
H5 120 125 

Thr Asp Val Cys Asp Gly Leu Arg His Pro Pro Pro Pro Thr Ala His 
130 135 140 

Trp Phe Leu Leu Ala Glu Cys Cys Phe Leu Cys Leu Tyr Ser Ala Met 
145 150 155 J 160 

Met Thr Trp Gly Val Ser Glu Gly Leu Arg Asp Ala Cys Trp Lys Tyr 
165 170 175 

Cys Gly 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW070 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: complement (639.. 928) 

(D) OTHER INFORMATION: /label= SAC_24286 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GTCGACGCAG ATTTTGATGG CGTCCTTTAT CATATTTCAA ATCCTAATGG AGACAAAACA 
AAAGTGATGG TCAGTATTTC TTTGAAATTC TACAAGGAAC TTCAGGCACA CGGAGCTGAT 
GAGTTACTAA AGAGAGTATA CGGAAGTTTC TTGGTGAACC CAGAATCAGG ATACAATGTG 
TCTTTGCTAT ATGACCTGGA AAATCTACCT GCATCCAAGG ATTCTATTGT GCATCAGGCT 
GGCATGTTGA AGCGAAATTG TTTTGCCTCT GTGTTTGAGA AATACTTCCA ATTCCAAGAA 
GAGGGCAAAG AAGGAGAGAA CAGAGCAGTT ATCCATTATA GAGATGATGA GACCATGTAT 
GTGGAATCTA AAAAAGACAG AGTCACAGTA GTCTTCAGCA CAGTTTTTAA GGATGACGAC 



60 
120 
180 
240 
300 
360 
420 
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GATGTGGTCA TTGGAAAGGT GTTCATGCAG GAGTTCAAAG AAGGACGAAG AGCCAGCCAC 480 

ACAGCCCCCC AGGTCCTTTT TAGCCACAGG GAACCTCCTC TAGAACTGAA AGACACAGAT 540 

GCTGCCGTGG GTGACAACAT TGGCTACATC ACCTTTGTGC TGTTCCCTCG CCACACCAAT 600 

GCCACTGCTC GAGACAACAC CATCAACCTG ATCCACACGT TCCGGGACTA TCTGCACTAC 660 

CACATTAAGT GCTCTAAGGC CTATATTCAT ACACGAATGC GAGCAAAAAC ATCCGACTTC 720 

CTTAAGGTGC TCAACCGTGC ACGCCCAGAT GCCGAGAAAA AAGAAATGAA AACAATCACG 780 

GGGAAGACTT TTTCATCCCG CTAACTCTTG GGAACTGAGG AGGAAGCACT GTTGACCGAA 840 

GGCTGGACCG CTTGCTACTG GATAATCGTA GCCCTTCATG TTGCACCTTC TCCAGGTTCT 900 

TAAGGGATTC TCCGTTTGGG TTCCATTTTG TACGAGTTTG GCAAATACGC TGCAGAAACG 960 

AGCTGTGCTT GCAAGGACTT GATAGTTCCT AAGAATTAAA AACTCACTTG ATCAACTTAA 1020 

TTCCCTTTTA TTTTTCCTCC CTCACTCCCC TTCCTTTTCC AAGCTGTTTG CTTTGCAATA 1080 

TGTTACTGGT AATGAGTTGC AGGTAATGCA ATCTTAACTT GTTTTCTTCT AAGTATTTGA 1140 

GTTCAAAACT CCTGTATCTA AAGAAATACG GTTGGGGTCA TTAATAAAGA AAATCTTTCT 1200 

ATCTTAAAAA 121Q 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW073 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 9.. 1223 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 598.. 1036 

(D) OTHER INFORMATION: /label= SAC_24456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

GTCGACCT GCC GGC CGA CAG GGC CGC GGG ATG AGG AAG CGG ACC GAG CCC 50 
Ala Gly Arg Gin Gly Arg Gly Met Arg Lys Arg Thr Glu Pro 
1 5 io 

GTC ACC TTG GAG CAT GAG CGC TGC GCC GCC TCA GGC TCG TCT TCC TCC 98 
Val Thr Leu Glu His Glu Arg Cys Ala Ala Ser Gly Ser Ser Ser Ser 
15 20 25 30 
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GGC TCG GCC GCC GCG GCG CTG GAC GCC GAC TGC AGC TTG AAG CAG AAC 146 
Gly Ser Ala Ala Ala Ala Leu Asp Ala Asp Cys Ser Leu Lys Gin Asn 
35 40 45 

CTG CGT CTG GCG GGC AAG GGG ACG GCA GAG CCG CAC AGC GCA TCC GAC 194 
Leu Arg Leu Ala Gly Lys Gly Thr Ala Glu Pro His Ser Ala Ser Asd 
50 55 60 

GCG GGC ATG AAG CGG GCT CTG GGC AGA CGG AAG AGC CTG TGG TTC CGA 242 
Ala Gly Met Lys Arg Ala Leu Gly Arg Arg Lys Ser Leu Trp Phe Arg 
65 70 75 

CTA AGG AAG ATA CTT CTC TGT GTT TTG GGG TTC TAC ATT GCC ATT CCA 290 
Leu Arg Lys He Leu Leu Cys Val Leu Gly Phe Tyr He Ala He Pro 
80 85 90 

TTT CTT GTC AAA CTG TGT CCT GGG ATA CAG GCC AAA CTG ATA TTC TTA 338 
Phe Leu Val Lys Leu Cys Pro Gly He Gin Ala Lys Leu He Phe Leu 
95 100 105 no 

AAT TTC GTG AGG GTT CCC TAT TTC ATT GAC TTA AAA AAG CCA CAG GAT 386 
Asn Phe Val Arg Val Pro Tyr Phe He Asp Leu Lys Lys Pro Gin Asd 
U5 120 125 

CAA GGT TTG AAT CAC ACC TGC AAT TAC TAC CTC CAG CCC GAG GAT GAT 434 
Gin Gly Leu Asn His Thr Cys Asn Tyr Tyr Leu Gin Pro Glu Asp Asd 
130 135 140 

GTC ACT ATT GGA GTC TGG CAC ACC ATT CCC TCT GTC TGG TGG AAG AAT 482 
Val Thr He Gly Val Trp His Thr He Pro Ser Val Trp Trp Lys Asn 
145 150 155 

GCC CAA GGG AAG GAC CAG ATG TGG TAT GAG GAT GCT CTG GCT TCT AAC 530 
Ala Gin Gly Lys Asp Gin Met Trp Tyr Glu Asp Ala Leu Ala Ser Asn 
160 165 170 

CAC CCC ATC ATC CTG TAC CTG CAT GGG AAT GCA GGC ACC AGA GGA GGT 578 
His Pro He He Leu Tyr Leu His Gly Asn Ala Gly Thr Arg Gly Glv 
175 180 185 190 

GAC CAC CGT GTG GAG CTG TAC AAG GTG CTG AGT TCC CTT GGT TAC CAC 626 
Asp His Arg Val Glu Leu Tyr Lys Val Leu Ser Ser Leu Gly Tyr His 
1^5 200 205 

t?? ££ C T ^ GAC TAC AGA °GT TGG GGT GAC TCA GTA GGA ACA CCA 674 

Val Val Thr Phe Asp Tyr Arg Gly Trp Gly Asp Ser Val Gly Thr Pro 
210 215 220 

TCA GAG CGA GGC ATG ACA TAT GAT GCA CTC CAT GTT TTT GAC TGG ATC 722 
Ser Glu Arg Gly Met Thr Tyr Asp Ala Leu His Val Phe Asp Trp He 
225 230 235 

AAA GCA AGA AGT GGT GAT AAT CCT GTG TAT ATC TGG GGC CAT TCG CTG 770 
Lys Ala Arg Ser Gly Asp Asn Pro Val Tyr He Trp Gly His Ser Leu 
240 245 250 

59° A f T 5? A GTC GCA ACA AAT CTG GTC CGG CGC CTT TGT GAG CGA GAG 818 
Gly Thr Gly Val Ala Thr Asn Leu Val Arg Arg Leu Cys Glu Arg Glu 
255 260 265 270 

ACG CCA CCA GAT GCC CTT ATA TTG GAG TCT CCG TTC ACA AAT ATT CGT 866 
Thr Pro Pro Asp Ala Leu He Leu Glu Ser Pro Phe Thr Asn lie Arg 
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275 280 285 

GAA GAA GCA AAG AGT CAT CCA TTT TCA GTG ATA TAC AGA TAC TTC CCA 914 
Glu Glu Ala Lys Ser His Pro Phe Ser Val lie Tyr Arg Tyr Phe Pro 
290 295 300 

GGC TTT GAC TGG TTC TTC CTC GAC CCC ATT ACA AGC AGT GGA ATT AAA 962 
Gly Phe Asp Trp Phe Phe Leu Asp Pro lie Thr Ser Ser Gly He Lys 
305 310 315 

TTT GCA AAT GAC GAA AAC ATG AAG CAC ATC TCC TGT CCC CTG CTC ATC 1010 
Phe Ala Asn Asp Glu Asn Met Lys His He Ser Cys Pro Leu Leu He 
320 325 330 

TTG CAC GCT GAG GAT GAC CCA GTT GTA CCC TTT CAT CTC GGC AGG AAG 1058 
Leu His Ala Glu Asp Asp Pro Val Val Pro Phe His Leu Gly Arg Lys 
335 340 345 350 

CTA TAC AAT ATT GCT GCG CCA TCC CGA AGT TTC CGA GAC TTC AAA GTC 1106 
Leu Tyr Asn He Ala Ala Pro Ser Arg Ser Phe Arg Asp Phe Lys Val 
355 360 365 

CAG TTC ATC CCC TTT CAC TCA GAC CTT GGC TAC AGA CAT AAG TAC ATC 1154 
Gin Phe He Pro Phe His Ser Asp Leu Gly Tyr Arg His Lys Tyr He 
370 375 ~ 380 

TAC AAG AGC CCA GAG CTT CCA CGG ATA CTG AGG GAA TTC CTA GGG AAG 1202 
Tyr Lys Ser Pro Glu Leu Pro Arg He Leu Arg Glu Phe Leu Gly Lys 
385 390 395 

TCG GAA CCA GAG CGC CAG CAC TGAGCCTGGC CCGTGAAGGA GCATGGAGAC 1253 
Ser Glu Pro Glu Arg Gin His 
400 405 



CCACCTTCCT 


TCCCTTCTCC 


CTGAACAGCA 


GTCTGGCACC 


CAGAAGCTCA 


GAGTGCCACC 


1313 


ACCTGTGGTG 


CTCAGGAGCC 


CAGCCTAGAA 


AGAGGACTCC 


GACACAGCGG 


GCAGAGGCTC 


1373 


CACAGACGGA 


TCTATGAGGA 


AAATACGGTG 


GCAGGCAGGC 


AGGCAGGCGA 


CCCCCTGACC 


1433 


CTCTGGTGGC 


CGCTGTATCT 


GAGCCCTTTT 


GGGAAGGCTT ATAGACAACA 


GGTGGAGCCC 


1493 


ATACGCTGGG 


CATAGGGAGC 


CTGGGAAGGG 


CTCAGGAGCT 


CAGGACCACT 


CCAGGCTCTC 


1553 


TAGCACCACC 


GCTTAAAATA 


CAGGAAAAAG 


GTTCTTTCTG 


CCCTTCCTGG 


CGTACACAGA 


1613 


ACAGATTCCA 


AGTGGTTCAA 


TTTGTCCCCT 


ACAGCTCATG 


TACCTGCTTG 


CCTTCCTCAG 


1673 


CTGTCCCTGC 


CTCTCCTGGC 


ATCTGTACAC 


CCACAGTGAG 


GGGCACCTGG 


ACTTGCACTT 


1733 


CCATTCTGCC 


CACCTGTCTG 


TCACCTAACC 


TGGCCGTAGA 


CTGAGCATTT 


ATTTAAGAAT 


1793 


AAAATCTCGG 


TGGTGGTCTA 


TTTGTTTTTT 


TCTACAAAAA AAAAAGTCGA 


C 


1844 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Ala Gly Arg Gin Gly Arg Gly Met Arg Lys Arg Thr Glu Pro Val Thr 
15 10 15 

Leu Glu His Glu Arg Cys Ala Ala Ser Gly Ser Ser Ser Ser Gly Ser 
20 25 30 

Ala Ala Ala Ala Leu Asp Ala Asp Cys Ser Leu Lys Gin Asn Leu Arg 
35 40 45 

Leu Ala Gly Lys Gly Thr Ala Glu Pro His Ser Ala Ser Asp Ala Gly 
50 55 60 

Met Lys Arg Ala Leu Gly Arg Arg Lys Ser Leu Trp Phe Arg Leu Arg 
65 70 75 80 

Lys He Leu Leu Cys Val Leu Gly Phe Tyr He Ala He Pro Phe Leu 
85 90 95 

Val Lys Leu Cys Pro Gly He Gin Ala Lys Leu He Phe Leu Asn Phe 
100 105 no 

Val Arg Val Pro Tyr Phe He Asp Leu Lys Lys Pro Gin Asp Gin Glv 
H5 120 125 

Leu Asn His Thr Cys Asn Tyr Tyr Leu Gin Pro Glu Asp Asp Val Thr 
130 135 140 

He Gly Val Trp His Thr He Pro Ser Val Trp Trp Lys Asn Ala Gin 
145 150 155 160 

Gly Lys Asp Gin Met Trp Tyr Glu Asp Ala Leu Ala Ser Asn His Pro 
165 170 175 

He He Leu Tyr Leu His Gly Asn Ala Gly Thr Arg Gly Gly Asp His 
180 185 190 

Arg Val Glu Leu Tyr Lys Val Leu Ser Ser Leu Gly Tyr His Val Val 
195 200 205 

Thr Phe Asp Tyr Arg Gly Trp Gly Asp Ser Val Gly Thr Pro Ser Glu 
210 215 220 

Arg Gly Met Thr Tyr Asp Ala Leu His Val Phe Asp Trp He Lys Ala 
225 230 235 240 

Arg Ser Gly Asp Asn Pro Val Tyr He Trp Gly His Ser Leu Gly Thr 
245 250 255 

Gly Val Ala Thr Asn Leu Val Arg Arg Leu Cys Glu Arg Glu Thr Pro 
260 265 " 270 

Pro Asp Ala Leu He Leu Glu Ser Pro Phe Thr Asn He Arg Glu Glu 
275 280 285 

Ala Lys Ser His Pro Phe Ser Val He Tyr Arg Tyr Phe Pro Gly Phe 
290 295 300 

Asp Trp Phe Phe Leu Asp Pro He Thr Ser Ser Gly He Lys Phe Ala 
305 310 315 320 
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Asn Asp Glu Asn Met Lys His 
325 

Ala Glu Asp Asp Pro Val Val 
340 

Asn He Ala Ala Pro Ser Arg 
355 

He Pro Phe His Ser Asp Leu 
370 375 

Ser Pro Glu Leu Pro Arg He 
385 390 

Pro Glu Arg Gin His 
405 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW074 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 7.. 300 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: complement (64.. 365) 

(D) OTHER INFORMATION: /label= SAC_24464 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

GTCGAC AGC CCT CGA GGG AGC AGA CGC CGC GGG CCA CTG GCT TCC TCT 48 
Ser Pro Arg Gly Ser Arg Arg Arg Gly Pro Leu Ala Ser Ser 
15 10 

ACT CAG GAA CCA CGG ATG TCA CCG CCC AGT CAG CTG TGT CTC CTC ACC 96 
Thr Gin Glu Pro Arg Met Ser Pro Pro Ser Gin Leu Cys Leu Leu Thr 
15 20 25 30 

ATT GTC GCC CTG ATT CTG CCT AGT GAA GGG CAG AC A CCA GAA AAA CCC 144 
He Val Ala Leu He Leu Pro Ser Glu Gly Gin Thr Pro Glu Lys Pro 
35 40 45 

AGA TCC AGT TTT ACA GCG CAC CAG AGT TCT GTG ACT ACT CAT GTC CCA 192 
Arg Ser Ser Phe Thr Ala His Gin Ser Ser Val Thr Thr His Val Pro 
50 55 60 

GTT CCA GAT CAA ACT AGC CCA GGA GTC CAG ACC ACT CCT CCC ATC TGG 240 
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He Ser Cys Pro Leu Leu He Leu His 
330 335 

Pro Phe His Leu Gly Arg Lys Leu Tyr 
345 350 

Ser Phe Arg Asp Phe Lys Val Gin Phe 
360 365 

Gly Tyr Arg His Lys Tyr He Tyr Lys 
380 

Leu Arg Glu Phe Leu Gly Lys Ser Glu 
395 400 
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Val Pro Asp Gin Thr Ser Pro Gly Val Gin Thr Thr Pro Pro He Trp 
65 70 75 

ACC AGT GAA GCT GCG AAG CCA CAG GAA GCC AGA CAG CAG CCA AAA CCA 288 
Thr Ser Glu Ala Ala Lys Pro Gin Glu Ala Arg Gin Gin Pro Lys Pro 
80 85 90 

AGA CCC AGC AAC TGACCGAAAT GGCCACTGCG AATCCAGTGA CAGATCCAGG 340 
Arg Pro Ser Asn 
95 

GCCACTTACA AGCAGCGAGA AAGGTACCCC GGCACTCTCC AGGATCAAAT CTCCCAGCCC 400 

ACCCAAAGGT TACATGCCTC CATCCTACAT TGAGAATCCA CTGGATCCCA ATGAGAACAG 460 

CCCCTTCTAC TACGACAATA CCACCCTCCG GAAACGGGGG CTGCTGGTGG CGGCAGTGCT 520 

GTTCATTACT GGAATTATCA TCCTCACTAG TGGGAAGTGT AGACAGTTCT CTCAGTTATG 580 

CCTGAATCGC CACAGGTGAG TGGGAGCCAG CACCCTGATG GGCACCCCAA CTGGAGCCGC 640 

CATACCATAC CAGTTCACCA CCCCTGCCTC CCTCCCTCTG CTCCAAGAGC CAACAGAGTG 700 

GTCAACATAA ATGGATCCTC AAAGGAAGAG GCCACCGGAG GGAGCCAGGC CTAAGGCTAA 760 

ATGGTCTTCC CACCCTGAGG AGAGAGGTCT CCCCAGGCAC TGCTGTGATC CTGCCTATCC 820 

TGTTCAGATA AATCCACATG GTCTCTCTTC AAAAA 855 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Ser Pro Arg Gly Ser Arg Arg Arg Gly Pro Leu Ala Ser Ser Thr Gin 
15 10 15 

Glu Pro Arg Met Ser Pro Pro Ser Gin Leu Cys Leu Leu Thr He Val 
20 25 30 

Ala Leu He Leu Pro Ser Glu Gly Gin Thr Pro Glu Lys Pro Arg Ser 
35 40 45 

Ser Phe Thr Ala His Gin Ser Ser Val Thr Thr His Val Pro Val Pro 
50 55 60 

Asp Gin Thr Ser Pro Gly Val Gin Thr Thr Pro Pro He Trp Thr Ser 
65 70 75 * 80 

Glu Ala Ala Lys Pro Gin Glu Ala Arg Gin Gin Pro Lys Pro Arg Pro 
85 90 95 

Ser Asn 

(2) INFORMATION FOR SEQ ID NO: 68: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW075 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: complement (3785.. 4014) 
(D) OTHER INFORMATION: /label= SAC_24466 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

GTCGACATGG CGAAGTACTC TAGCCTGGAA GCCAAGCTCT GCCAGGTTGA AAGTAAATAC 60 

TTGATACTGC TCCAAGAAAT GAAGACACCA GTTTGCTCAG AAGAACAGGG CCCTGCCAGG 120 

GATGTCATAG CCCAGTTGTT GGAGGACGCT CTGCAGGTTG AGAGCCAAGA GCAGCCAGAG 180 

CAAGCGTTTG TCAAGCCTCA TCTGGTCAGT GAGTTCGATA TCTACGGGTT TAGGACTGTC 240 

CCTGACGATG ATGAGGAAGA GAAATTGGTC GCCAAGGTCC GAGCATTGGA CCTGAAGACT 300 

CTGTACCTCA CAGAAAACCA GGAAGTTTCC ACTGGGGTCA AGTGGGAAAA CTATTTTGCA 360 

AGCACAATGA ACAGGGAGAT GGTGTGCTCT CCTGAGCTGA AAAACCTGAT CCGAGCAGGC 420 

ATTCCCCATG AGCACCGCTC CAAGGTGTGG AAGTGGTGTG TTGACCGTCA CACCAGGAAG 480 

TTCAAGGACA GCATGGAGCC AGGCTACTTC CAGGCCTTAC TCCAGAAGGC TCTAGAGAAA 540 

CAGAACCCGG CCTCCAAGCA GATTGAGCTG GACCTGCTTC GGACTCTGCC CAATAACAAA 600 

CATTACTCCA GCCCCACGTC AGAGGGCATA CAGAAGCTTC GCAATGTCCT GCTTGCCTTC 660 

TCATGGAGGA ATCCGGATAT TGGCTACTGC CAAGGCCTAA ACAGGTTGGT GGCAGTGGCG 720 

CTCCTTTACC TGGAACAAGA GGATGCTTTC TGGTGTCTCG TTACCATTGT GGAAGTCTTC 780 

ATGCCTCGAG ACTATTACAC AAAGACTCTA TTAGGATCCC AGGTGGACCA GCGGGTGTTC 840 

AGAGACCTCC TGAGTGAGAA GCTGCCTCGA CTGCACACCC ATTTTGAGCA GTACAAAGTG 900 

GACTACACCC TCATCACCTT CAACTGGTTT CTGGTGGTAT TCGTGGACAG CGTTGTCAGC 960 

GACATCCTCT TTAAGATATG GGACTCTTTC CTTTATGAGG GACCAAAGGT TATTTTCCGT 1020 

TTTGCCCTGG CACTTTTTAA ATACAAGGAA GAGGAGATCC TGAAGCTGCA GGATTCGATG 1080 

TCCATTTTCA AGTATCTCCG ATACTTCACT CGCACTATCC TTGATGCCAG GAAGCTGACC 1140 

AGCATCTCCT TTGGGGATCT GAACCCCTTC CCCCTGCGCC AGATCCGGAA CCGGAGAGCC 1200 
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TACCACTTGG AGAAGGTCCG GCTGGAGCTG ACAGAGCTGG AGGCCATTCG AGAGGACTTC 1260 

CTGCGTGAGC GGGACACTAG CCCTGACAAA GGCGAGCTGG TTAGCGATGA GGAGGAAGAC 1320 

ACTTGAATGA ATCCCCACCC CAGGATGCTG CCTTCCTTCT TGCCTTCCTT CATAACCAAA 1380 

AGTAGCAAAG GCGCATCTGC ACATCAGCTG TCAGAGTGAA ACATCTACAG CCTCTGGGCA 1440 

CCTGCGGGGC AGATGTCCAA AGCTAAGCTG CACTTTCTCG TCTGCCATGA GGGCAGGGGT 1500 

CTGATGGCTT TACAGCCATT TCCATGCCGT GGCATTCAGT CATATGCGTC ATATGCTTGC 1560 

TGCCTGATGG GTGAGTCACC AGGCGACTGT TTCCAAGGGC CTGTTTACAT CATCTGCCAG 1620 

TTGTATGTTT CACCTACAGA AACTGTTTGA CATTTCCCTG TGCGCTTTCT TTTTATCCAA 1680 

GGGCAACTCT GTGGAGGAGT GCACATGGAG CTGCAGCGTG GCTGCATCCA GGGGTAAAAG 1740 

GTCACCATAA GAAGAGGGAC CTCTTTAAAA AAAAGCGAGC ACAGAACTGG CCTCCCTAGA 1800 

GAGCGTGGGA GCCCTCTGCT GGTGGCTTCG CACTGCCAGC CACTATCCCT TCTGGACTAA 1860 

CCTTTGCCCT CCTATACAGC CTTTTCAGAT TGTAGCACTC ACGCTGCTCT GCAGCCAGCC 1920 

CCACACTTCT CATTGCCAGC TATTCTGCCA GATGTCTGTC CCAGCTTACA GTGGGGCTAG 1980 

CGCATCTGGG AGGTGACTTA TAGACAACAT TCTCCAACTG TGGCTAGATT GGCGTTCCAA 2040 

AGGAACCTTA CAGTAACCCA CCCCTGCATA AAGGAACACA GGAAACTCCT ATCTGTGGCT 2100 

TGTGGGCTCC TGGATGGCAG TGGCCAGGGC TCTAGAATCT CTTCTGCCCA ACAGCTGGGT 2160 

GGGGAGTAGA GCTGACTGCC AGCTTCCTAC ATCTGGAAGC ACCTTTGCTT GGGAACAAGA 2220 

GTAATGAGTT CTCCTAGTCT CTGTGTTGCC TGGATCATCC TTGTCCTTAA CAGGGATCTG 2280 

AGGACACTGT GGCCTCTGGT ATAGCAGACT CTGGGGACTT GGAACACTTC CTTCTTGAAG 2340 

CCCAGAGAGG GTCTGAACAA AAAAGGTTAT TTTTGCTCAG TCCCAGAGCA TCCCTAATGG 2400 

TGGCTGTTAT GAGAAAACAT CTTCTTACAC AGTATTATGG GAACACAAGG GAAACAGCCT 2460 

CAAGTGGACT ACTTTACACT ACCTAGGGAA AAGAAATTAG AAACAATCCT TAGTGTAAAC 2520 

ACACTCAGAC TGAGGAGTGG AGCAAGACAG TTTCGAGAAA CGCTTTTTCC ATGACCCGTT 2580 

TTCCTGTTTT TGCTGTCTGC TTTCTAGCAC TTGAATGGCT GAGTCCATTT TCAGCATCTA 2640 

AATTTGCTTC ACTTCTCTAA AGTTGCTTCA TATATTACCA TGTAAAAGGA GCTGGTCTCT 2700 

ATATTTAGCC AATAGCCCGT CTCTTTTCAG GTATCAGAAA TGTCATCCTT GTGTCTTTGA 2760 

GATGATGGTT TGAGGCCCGG CTTCTTGAAG AAGCTGTCTG GAGCTGGACA GTGATCTATC 2820 

TCACCACTGT TCCTAGGCCT CCCTGAGACC AAATGCTGGA AGATGCCCCA CCCCCTGTAA 2880 

GATGCCGGCT GTCAGCCTTG GGGACAGCTG TTAGGAGGTT AGAGTTGGAG CCCCATGGCT 2940 

TAGCTCCAGC CGCTCTGATT TCGGTTGTCA TGTCAATTTC CTACCAGAAG ATCCAGTCCA 3000 

AATAAGTATT TGGTGTGGAT GCTCTCAGTG GAGCATTTGG TGTGGATGCT CTCAGTGGTG 3060 

GGTTGCTTTC GCCCTGGCAC CAGTGGAGGC ATCTTTAAGT ACTTGGTGAT GCTGAGGACA 3120 
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GGACCATGTT 


CCTGGCATGC 


GGGGCCCTGG 


GTTCCATCCC 


AGTGATGAAA 


AAAAATCTTC 


3180 


AGTATGTGGG 


GATTGTGGGG 


AGGCGCTGTT 


CTGCCATCCC 


CTGAGAGACA 


GCATTAGTTT 


3240 


CTCAGGCAGT 


TAGTATCAGT 


GCCCAGGTAA 


AAGTCTCAAG 


AGGTCCCAAA 


GGGACTCTCC 


3300 


TCCCTGGAAT 


AAATGACTCG 


ATGTCTGCAG 


AGCGGAGAAG 


GGACGGTTGC 


CCTCTTTCCA 


3360 


GTTTGGGGAC 


AGAATAACAG 


AGATTCATCC 


CAGAGGCACA 


GCCCCAGTGG 


TTGAGCCATC 


3420 


GTCTCTCATG 


CCCTCGTCAC 


TGAGATTGTC 


TTTACCCTCC 


AGTACTCCTG 


AGACTCCAGA 


3480 


CTTCTACCAG 


ACACCATGGC 


AGGTACCCCT 


AAGGTCAGTG 


GCTGCCTCAG 


GCTGCTTTTC 


3540 


ATCTAAACCT 


TTACCCAGCA 


TGCAACCCAT 


GGCTCCTGCC 


TGGTGAGCCT 


TGTGGGCTGA 


3600 


GAAACCCAAA 


GCACAGCCTC 


CAGCGCGTGG 


GTCCAAATGC 


GTGTGCCACA 


GTAGGGGAGC 


3660 


ACCCTGCCCT 


CAGCACCCCA 


TATGGTGGGA 


CTTGCCAGTA 


TTTCAACTGT 


TGATACTTTT 


3720 


ATATTTTCAC 


TACATTTTAT 


ATTTTTATAG 


ACTATTTTAT 


ATTAAGGTTT 


TTTCTAGATT 


3780 


CTGTACAACT 


TTTATATAAC 


AAGTTCTATT 


TTGTGTGCAC 


ATCTCCCCTG 


TATGTATTTA 


3840 


TAGCAACCCA 


AGCCTCTCAA 


ATTCCAGGTC 


CCTCTTGTGC 


CACCTTTTCC 


CCTTAGCCTT 


3900 


GACTGTCACT 


CTCTTCTGCT 


CACCGAGGTA 


GCCTCGGGGG 


CTACTTCTAG 


CCACCATGAC 


3960 


AGCCAGACAG 


CAGGAACTGG 


TGTTTGTCAG 


CTGTGGTCTG 


TTGATACAGG 


AAGTACTCTG 


4020 


GGCAGTAACC 


CAGTAGCTAT 


GTTACTGCAC 


AAGCCCTTCT 


CGCTTAGTGC 


AGTGATGTCC 


4080 


TGGTGCAGCC 


ACCTGTGGGC 


GGGTGGCTGG 


ACCCATCACG 


AAAGTCCTTG 


AAGACAGAGG 


4140 


CTAGATGAAA 


GCCCAGTGTC 


TGAGGTCCTT 


CTGTGGGCAA 


CTGGAGGACT 


TGAGGGCACA 


4200 


CTGCCAAGTG 


ACTGTCCTTT 


TATATCCTAG 


CAGCTTCTGT 


GCTCACCCTT 


GAGTTAAATG 


4260 


TTAGACTTTT 


AAAGGAAGAA 


ATAAACTAAT 


TTGTATTACA 


TTCTGCCTCG 


AAAAA 


4315 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW076 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: complement (42.. 2 81) 

(D) OTHER INFORMATION: /label= SAC_24409 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GTCGACTTTT GCCTTTAAAA TTCTGTTTAC TTAGCATATG TACACGTAAA AGAGAAATCA 60 

CCCTTAGATC CCACCCAGCA GAAAACATGC ACAAACCCAA GGTATATGTG AAAGGGAGGG 120 

CCCCAGCCCC AAGTTAGCAG TTGGGATGGT GCAATCCTGT GAGTCACAAG CCCATGGCAA 180 

GAGGGCTGGA CTAAAGTTTC AACACAGCCA GAACCCCTGC TGCTGGCCCT AAAATAAGCG 240 

TGCACCATAG CCAAAGGTCT GTTTGATGCC CTCAAAGTAG TACCGCTGCC AGCCTTGCCG 300 

TGTCCGCTCT TCCTCGGGAG CAGGGATGCC GCGGCCCTCC ATGCACAGCT CTGTCTCTCC 360 

ATTCTTATCA ATGAAGGTCA AGGTGATGGT CGCGAAGTGC CCCTCTGGCC ATGACTTAAA 420 

CCTCCACTTC ATAGCAATAT GTTTCTCAGG AACCAAGTCA GTAAACTCCC CAGTGACGTT 480 

ACCATCGACC ATGTGAAACT TCCCTCCTCT GTCTGCTTCC AAGGCCGCAG GAGCATGGGT 540 

AAAGGCCTGG ACCAGCTCCT GCGTGGTGAA CACTCTGTAG AGCTCCTCTG GAGAGGTCAG 600 

GAAGGTTTCT TTGAGGGTGA TCTTACAAGT GGGTATTTTG ACACCAACAG GTTTGGCCTG 660 

GCTTTTTGAA GGAGCACACT TAGCCTTGCA CACTTCCGTT TTTAGTGCTG GTTGGCCCAC 720 

TGGGTCTACT GATTCTCCAT TCACTGTGGG CAAGATCATG CCCTGAGTAA ACTCTGTCTT 780 

GAGGGTGCTG ATGTAAATTC CCACTGCTTC TCTGAGAAGT TTCACCCCGT CTTCCTTCAT 840 

TAAGGCCACG AGACTTGTGT CAGGCTCATC TTTGGCAAGG CTGCTTCTTA GAGCCCTTGT 900 

TATGGCAGCG CTCCATCTAG GAAACAGCAG TCAGTCTCTC CTGCTCACTG TTCTCCCTCT 960 

CTGTGCTCAA GTCTGTCAGT GCTCGCTGGC CTCTGGCTCT TCCCTTTGGG GAGGGAGCGG 1020 

CTGTTCTGCC TGCTTCACCG AGTCTTCACA AGCTGAGGGG ATGAGGCCAG GTTAGTACCA 1080 

GGGACCAGCA ACCTTTCAGG GCTCCTTGGG GGTCTCAGGT CCACACACAG TGAGCATCCC 1140 

TTGCCAGGTG CCCCATGTAT CTGTCAGTGC TGAGACGCCA AACACACAAC CTGTTCCCAG 1200 

CTGCTGCTGG GGGCCACTGT GACCGAGGAG AGTGTGGGGT TGCCAGGGAG AGCGCTGTCC 1260 

CTCTGACAGC CTGACCAGAT GATGTAGCTT TGAAAGCCGG CAGCATGTGA GTAGCAAGAT 1320 

GTGACCTTGA GCTAGCTGAG CAGAACACAG GTGGATGGGC AGAGGCCCCC ATCCTGGCGA 1380 

GCTGAGCCTG GCTGTGTCCT GTCAGGACGC CCTACTCTGT GACCCTACCC TGTGATCTGT 1440 

GAAGTGGTGC AGTCGGAAGA CGAGGCTGGA GTCCTTTACA AGTTTCTACA CTAGATTGTA 1500 

GCCAGGTTTT CTCCCAGGCC TTAGGCTTTG GTTTACTTTT AATTTTAATT TCACTGTTAC 1560 

CCTTGACTAT TGTCCTTTTT GTTGCAGTGT TGGAGAGGCA GGAGGTATTT GTGCCTCATC 1620 

TATTACTGCA AAAAAATTAA ATAAATGTAA CAATAAACGT CCTAAAAA 1668 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1019 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW078 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: complement (211.. 743) 

(D) OTHER INFORMATION: /label= SAC_24345 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



GTCGACGCCA 


GCGTGAGCGC 


CCTGCAGCGC 


CTGGTGGAGC 


AGCTGAAGCT 


GGAGGCCGGC 


60 


GTGGAGAGGA 


TCAAGGTCTC 


TCAGGCAGCT 


GCAGAGCTTC 


AGCAGTACTG 


CATACAGAAT 


120 


GCATGCAAGG 


ACGCCCTGCT 


GCTTGGTGTT 


CCGGCTGGAA 


GCAACCCCTT 


CCGGGAGCCC 


180 


AGGTCCTGTG 


CTTTACTTTG 


AAGACTGGGA 


AGAAGCTCTC 


TGAGGAACCT 


GTGTGTGCAT 


240 


GGTGATGGAT 


GACTACCGCC 


AAGTCCCAAG 


AAAACAGATT 


TCAAGAGCCA 


AATCATTTTC 


300 


TCTGTGGCCT 


GGTCCTAGGT 


CAAAATTCTA 


CAGAAATCAA 


TGTTTTGTCT 


TATTTTTTAA 


360 


CCTCATTACC 


AAACTCACTG 


GGTGCATCTC 


TCAGTCAAGC 


CTGTGCGGCC 


CTGCCTGGGA 


420 


TGGAGGCATG 


TTTTGTTTAC 


TGGGTATTTG 


CATTTCTGGA 


TGGAAATGTT 


GTATTTCCGA 


480 


AAACTGAGTA 


TTTGTAATTT 


TCCTAACATT 


TTTATACTGT 


AATAAAAAAT 


TTTCATAAAA 


540 


ATTTACAGTT 


ACTAAATTAG 


ACAAAGTATA 


AATTGGGGAA 


TGATTGATAT 


GCTGGTTTCG 


600 


TAAAGGAGGC 


CTTTCCAGTC 


CCAGTGAGCA 


AACGGTCTGA 


CCTGGACGGG 


AAGCTGGGGA 


660 


GCCGCTCCTT 


AGCACACGGA 


GCATGGATGT 


GTTGTTAGGT 


GGCCCTCCCG 


TGTGAAGCCA 


720 


TTAGAACTTT 


CTTTAGTTCC 


TGTACTCTAC 


AACTTATAAT 


TGTGTCTGAT 


GCTTTGAAAA 


780 


GTGTGCTCAT 


GCCTTTAAAA 


ATGTTTAAAT 


GTGATCTACA 


CTTAGAACGT 


TGCAATTGGT 


840 


TAAAAACAAA 


ATATGAAGAT 


ATTATAACCT 


AAAAGAATTC 


ATGTCACACA 


TATTTTATTG 


900 


GGGGTGATGT 


GCCTTTGATT 


TAATTGGGGA 


CACTTTTAGA 


AGGAAAGTTT 


GTGTTTGTGA 


960 


TTATCTTTGA 


AGAGCTTGGA 


AATAAAATTT 


TTGCCTAATT 


CATCATTTTC 


CATGAAAAA 


1019 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW080 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 629.. 919 

(D) OTHER INFORMATION: /label- SAC_24033 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



GTCGACGCTG 


GGTAAAAGAA 


Wai l^UALCA GAGCCTGTGG GTTTGTCCTT AGGCTAGAGG 


60 


CCCCAAGATT 


TGGAACCTC5fl 


i\i<jAAi\.AUT AGGAGGGACA TTGAAGCTCA CAGATATATC 


120 


ACACGTTGGT 


CAGTTOPTTP 

wTW X X VV X X W 


i\3UAALi\?iji CCGTTAGGTG CTGTGGGACG AGCCCTTTCC 


180 




nuu x i V— intn 


TCCACTGTCT GCCTGTGTCT CAGTTTACAT CTTCCTTTTG 


240 


AATATGGAAT 


CTCAAATAGC 


CCAGGCTAGC TTCAAATTTG TTAAGTAGTT GAGGACAACC 


300 


TCAATCTTTC 


TATTCGCTGC 


CTCCACCTCT CTCAGTTTAC CTTTTTTTTT TTTCTCCTTA 


360 


GGATTAAGTC 


CCGTACAGGC 


CCAGAGTGGT AAGCCATAAC ACCCCTGGTC TTTCTCTCTT 


420 


CCTCTCAAGA 


TTTCCTCAGG 


CTACCCCTTT TCCTTCTAGC TCTCTCTCTC TGCTAACGCC 


480 


GAGCCCTGAT 


TGTTAACCTG 


TGTCTCCCTC TTCATCCTTC TGAGACAATT ACCCAGGATG 


540 


CGAATGTTCC 


TCTGTGAGCC 


CGGGTGTACT GGCTGGGATT GTGCTGGGCG ACTTGGTGCT 


600 


GACTCTGCTC 


ATCGCCCTGG 


CGGTGTACTC TCTGGGCCGC CTGGTCTCTC GAGGCCGAGG 


660 


GACTGCAGAC 


GGGACCCGGA 


AACAGCACAT GGCTGAGACT GAGTCACCTT ATCAGGAGCT 


720 


TCAGGGTCAG 


AGGCCAGAAG 


TATACAGTGA CCTCAACACA CAGAGGCAGT ATTACAGATG 


780 


AACCCACCCT 


ATGCCCACCA 


ACAACCTGAT GCCCGGATCC ACTCATTCCA GACGCTTACT 


840 


CAACAAACCC 


TCCCTGGGAT 


CAGGACTCCC GCTGGAATAA ATATCCACAG AGTGCCCTCT 


900 


GGGAGATATC 


TGACCTTGTA 


CCGTTTCTGT CCCCAAATAA AAGACGAAGC AAAAACAAAA 


960 


A 






961 



(2) INFORMATION FOR SEQ ID NO: 72: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1890 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW082 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 150.. 569 

(ix) FEATURE: 

(A) NAME/KEY: raisc_f eature 

(B) LOCATION: 1381.. 1848 

(D) OTHER INFORMATION: /label= SAC_24469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

GTCGACCGCG GCCGCGTCGA CGGACTTGCG GTTCCTTCAG CATTCTGCCC GCGCACTGAG 60 

TGGCTGTTTC CTGGAGCTAC TGTGTTCCTT CGGGAGTGCA GCAGATCCCG TTCAGAACCT 120 

ATTACCAAGA GCCCCGAGCA GGACCCACC ATG CAG TGC TTC AAA TTC ATT AAG 173 

Met Gin Cys Phe Lys Phe He Lys 
1 5 

GTC ATG ATG ATC CTC TTC AAT CTA CTC ATC TTT CTC TGT GGT GCA GCC 221 
Val Met Met He Leu Phe Asn Leu Leu He Phe Leu Cys Gly Ala Ala 
10 15 20 

CTG TTG GCT GTG GGA ATC TGG GTG TCC GTC GAC GGG ACA TCT TTC CTG 269 
Leu Leu Ala Val Gly He Trp Val Ser Val Asp Gly Thr Ser Phe Leu 
25 30 35 40 

AAG GCC TTC GGA TCA CTA TCA TCG AGT GCC ATG CAG TTT GTC AAC GTG 317 
Lys Ala Phe Gly Ser Leu Ser Ser Ser Ala Met Gin Phe Val Asn Val 
45 50 55 

GGC TAC TTC CTC ATC GCC GCT GGT GCT GTA CTC TTT ATT CTT GGT TTC 365 
Gly Tyr Phe Leu He Ala Ala Gly Ala Val Leu Phe He Leu Gly Phe 
60 65 70 

CTG GGC TGC TAT GGT GCT CAC TCC GAG AAC AAG TGT GTG CTC ATG ATG 413 
Leu Gly Cys Tyr Gly Ala His Ser Glu Asn Lys Cys Val Leu Met Met 
75 80 85 

GTG TGT TGG ACC CAG CTC CCC AGG CCC ACA ACC AAG GGT TCC CCC TTC 461 
Val Cys Trp Thr Gin Leu Pro Arg Pro Thr Thr Lys Gly Ser Pro Phe 
90 95 100 

ATC CCT GAC ACC AGC CTC TGG GAT CCC AAA ACC CTA CTC TGG ATT CTG 509 
He Pro Asp Thr Ser Leu Trp Asp Pro Lys Thr Leu Leu Trp He Leu 
105 HO 115 120 

GAA GCC TCG CTG TGG GGA CCC CCT GCC TCT TAT GGG AGT GTG CAG GAA 557 
Glu Ala Ser Leu Trp Gly Pro Pro Ala Ser Tyr Gly Ser Val Gin Glu 
125 130 135 

GGT GTG TAT CCC TAAAACGTGT TTCCTTTCTC CACTCAGTTC TTTTCCATTC 609 
Gly Val Tyr Pro 
140 

TCCTCATCAT CTTCATTGCG GAGATTGCAG GGGCCGTGGT TGCTTTGGTG TACACCACAA 669 
TGGTGAGGCA CGGGATGGGG CATGGGAGGA AAACTGGAGC AAAGCTCTGA GAGTTGGTGG 729 
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TGGCCTGTGA ATGGCTACTT CCCCTACCAG TCTATTAATA CTGGCCTGAC CCGTCCAGGC 789 

TGAACAATTC CTGACATTTC TGGTGGTGCC TGCTATCGAA AAGGACTATG GTTACCAGAC 849 

TGAATTCACC CAAGTGTGGA ACTCTACGAT GGAGGGGGTA AGGTGGGCCG GGAGGATTTG 909 

GGAATGGAGA GGAGAAAAAA ATCAAGGCAC CATCAGTTAC CAACCCCATT TTGTCTGCAG 969 

CTGCATTGCT GTGGCTTCAA CAATTACACA GATTTTAACT CCTCACGTTT CGTCAAAGAG 1029 

AACAAAGTCT TTCCTCCGTT CTGTTGCGCC AACAACACTG ACAGTCATAC AGTTGAACCA 1089 

TGCACCGAGG ACAAGGCCAA AAGTATGAAC GTACAGGTAT GTGCTGGTGT GGAATTAGCT 1149 

TTTTCGTTTG TGATCCGGTT TCTCTGTGTA GCTGTGGATG TCCTGGAACT CGATATGTAG 1209 

ACCAGGCTGG GCTTGAACAG AGATCGACCT GCCTCTGCCT CCTGAGTGCT GGGGTTAAAG 1269 

GCCACCATCA CCACCCAGCG CCAGTGTAAT TGCTTTAATG GCTTTACAGT AAGGAGTAAA 1329 

GGGGTTGGAG GTAGGGCACT GCCAGCTATA AATACTTTCT TCTCTCTCCC AAGGGTTGTT 1389 

TCAAACAGAT TCTGCAGAAA ATCAGAACCA ATGCAGTTAC CGTGGGTGGT GTGGCAGTTG 1449 

GAGTTGCGGC CCTAGAGCTG GCTGCCATGG TGGTATCCAT GTATCTATAC TGCAATCTGA 1509 

AGTAAGACTG CTTCTTCCTC CTGACTTGTC GTCACATGGA AACCAGGAAG AGGCACTTGG 1569 

CACAGCTAAG CAGCAGCAAT GGGTGGAGAT GAAACGTCAT TTGGGCTGGA GTGGATCTGC 1629 

CTTTGCTGCT CTGGACTTCG GGCTAATTAG GGACCACCGC TCGGGCTGTG TCTGTCTTGT 1689 

TCTTCCACTA CTGGGGCGAG GGGGGCATTG GTCCCAGTCT CTGAGGTAGC CATGTCTGCC 1749 

CATCTCCTGA GATTCTTGAT GATCCCAGCA CTCCACTGGT GGATGAAGGC ACTTATAGCC 1809 

TGGGCATATT CTAGCACTGC AAAGTCATAG AAGGAATGTA CAGAAGGTGT TTTGGAACTT 1869 

ATAAACCAAT GAAAATAAAA A 189Q 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 



Leu lie Phe Leu Cys Gly Ala Ala Leu Leu Ala Val Gly lie Tro Val 
20 25 30 

Ser Val Asp Gly Thr Ser Phe Leu Lys Ala Phe Gly Ser Leu Ser Ser 
35 40 45 

Ser Ala Met Gin Phe Val Asn Val Gly Tyr Phe Leu He Ala Ala Gly 
50 55 60 
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Ala Val Leu Phe lie Leu Gly Phe Leu Gly Cys Tyr Gly Ala His Ser 
65 70 75 80 

Glu Asn Lys Cys Val Leu Met Met Val Cys Trp Thr Gin Leu Pro Arg 
85 90 95 

Pro Thr Thr Lys Gly Ser Pro Phe He Pro Asp Thr Ser Leu Trp Asp 
100 105 110 

Pro Lys Thr Leu Leu Trp He Leu Glu Ala Ser Leu Trp Gly Pro Pro 
115 120 125 

Ala Ser Tyr Gly Ser Val Gin Glu Gly Val Tyr Pro 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW083 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION: complement (1950.. 2181) 

(D) OTHER INFORMATION: /label= SAC_24528 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



GTCGACCCAC AATCGCCCGA 


GCACTTGGAA 


TCACAGCAGA 


AACGACCCAA 


TCCTGAGCTC 


60 


CGTAGAAATG TGACCATCAA 


AACTGAACTC 


CCACTAGGCG 


CACGGCGAAA 


GATGAAGCCA 


120 


CTGCTCCCAC GGGTTAGCTC 


ATACCTGGTG 


CCCATCCAGT 


TCCCGGTGAA 


CCAGTCCCTG 


180 


GTGTTACAGC CCTCGGTGAA 


GGTTCCCTTG 


CCTCTGGCAG 


CATCTCTTAT 


GAGCTCAGAG 


240 


CTTGCCCGTC ATAGCAAGCG AGTCCGCATT 


GCACCCAAGG 


TGCTGCTATC 


CAACGAAGGG 


300 


ATAGCCCCAC TTCCTGCCAC 


AGAACCCATG 


AAGGAGGAGA 


AACCCCTGCT 


TGGAGAAGGG 


360 


CTATTGCCTT TGCTTCCTAT 


TCAGTCCATT 


AAGGAAGAAG 


TAATTCAGCC 


TGGGGAGGAC 


420 


ATACCACACT TAGAGAGGCC 


TATCAAAGTG 


GAGAGCCCTC 


CCTTGGAAGA 


GTGGCCCTCT 


480 


CCGTGTGCAT CAGTGAAAGA 


GGAACTGTCC 


AACTCCTGGG 


AAGATTCTTC 


CTGCTCTCCT 


540 


ACCCCAAAGC CCAAGAAGTC 


CTATTGTGGG 


CTTAAGTCCC 


CAACACGGTG 


TGTCTCAGAA 


600 


ATGCTGGTGA CAAAGCGGAG 


AGAGAAGAGA 


GAGGTGAGCC 


GATCTCGGAG 


GAAGCAGCAC 


660 
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CTTCAGCCAC 


CCTGTCTAGA 


TGAGCCTGAA 


CTCTTCTTCT CAGAGGACTC CAGCACATTT 


720 


CGGCCAGCCA 


TGGAGATCCT 


GGCAGAGTCT 


TCAGAGCCTG CACCACAGCT CAGCTGCCCT 


780 


CAGGAGGAGG 


GAGGGCCCTT 


CAAGACCCCC 


ATCAAGGAGA CATTGCCTGT CTCCTCCACT 


840 


CCTAGCAAGT 


CTGTGCTCTC 


TAGAGACCCT 


GAGTCCTGGA GGCTCACACC CCCAGCCAAA 


900 


GTTGGGGGGT 


TAGATTTCAG 


CCCAGTACGA 


ACCCCCCAGG GTGCCTTTGG CCCTCTGCCT 


960 


GACTCGCTGG 


GGCTTATGGA 


GCTGAATACC 


ACACCTCTGA AAAGTGTTCC CCTCTTCGAC 


1020 


TCACCCCGGG 


AGCTCCTTAA 


CTCAGAAGCC 


TTTGACCTTG CCTCTGATCC CTTTAGCAGT 


1080 


TCTCCACCAC 


CACATTTGGA 


AGCCAAGCCA 


GGCTCCCCCG AGCTGCAGGT CCCCAGCCTT 


1140 


TCAGCCAACC 


GTTCTCTCAC 


AGAAGGCCTT 


GTCCTGGACA CAATGAATGA TAGCCTCAGC 


1200 


AAGATCCTTC 


TAGACATCAG 


TTTCCCTGGC 


CTGGAGGAGG ACCCTCTGGG CCCTGACAAC 


1260 


ATCAACTGGT 


CTCAGTTCAT 


CCCTGAGCTG 


CGATAGAGGC AGGGTCTTAC CCTTGCCACT 


1320 


CAAGCCACCA 


GTTATCCTGG 


CACTTGTGTG 


GCTGGATAGT GCAAGGCTCA GTGTACCCCA 


1380 


AACCGTCTGA 


GGGAGCTAGC 


AGGCAAGGGC 


TGAGCGGTGC CCTTTGACCT AATTATGCCA 


1440 


AGGTAAAAGC 


CACGTCTAAG 


CCACTGCTGG 


GACCTATGCA AGCAATAGGA TCTCCCAGAG 


1500 


TCCTCCACTC 


CCTGCTGGCA 


AGTGAAGTGG 


GTGTGACAGA GCCGTGAGGA CCAGGAAATG 


1560 


CCCACCCATT 


AGTCACCTGC 


TGCTCCTGGC 


AGGATAACCC TTGTAAATGG TGTCAGTTCC 


1620 


CCAAGTTGTC 


CTGTAATTAT 


AAATGTAGCC 


ATATTCCCTT AGCTCTCATT ATCCAGAGAC 


1680 


TGCCAGGATG 


GGTAGGGTGA 


CAAGGGGTTG 


CATTAGCTTC TGCTTGTGGC CTTTGGGGGC 


1740 


AGGACCTGCA 


GTTCAGCCTC 


TTCACACTGT 


GGGTTCTGCT GTAGGCTTCT AGACACACAG 


1800 


GTGTCCTTGC 


CAGGACCCCA 


CTTACTGCCC 


TTTCCTCACA GCTCCCCCTG GTTCTAAGCC 


1860 


AGTGGTACTG 


CATGAAGAAA 


TCCTGCGGCA 


AAGCCTATTG GCTCTGGGTG TGTGGGGACG 


1920 


GGTGTGCCTG 


AAGCAAAAGC 


ATGGGTACTC 


ACGTGAGTCC TTTAGGTGTT TCTCTGATCG 


1980 


TGTTCCCAAT 


CATGCCAGGG 


AGTCTAGCAT 


TGAGAACTCA GGCTGAGGCC TGAGGAGGAG 


2040 


GAGGAAGTGA 


CCACTGACTT 


GCCTGGCTTC 


CTTAGCTTGC ACCTGAGTTT TGCAAAAAGC 


2100 


CACCCTAGAC 


CCCACTCTAC 


AAGCTAGCAC 


AAGAACACTA CTGTAACTAC CTACTGAATA 


2160 


AAGCCCAGGT 


GGCCTGAAAA 


A 




2181 


(2) INFORMATION FOR SEQ ID NO: 75: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW084 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION: complement (163.. 391) 

(D) OTHER INFORMATION: /label= SAC 24854 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 



TTTTTCCACC 


AAGAGGGAGT 


CCTTTAATAG 


ATTGCACAGA 


CAGACACAAC 


AGGCCAGCAC 


60 


CCTGCTTGCA 


GGCCCAGGCG 


CTTTGCCAGC 


AGATGCAATG 


GCCTCCCCCA 


CCTGGGTGTG 


120 


GGCCGCAGGG 


TGGAGATGAG 


GTGGTTTAAA 


AAACACAGCT 


GTACTAATTC 


TTCACTTTCA 


180 


CAGAAAAGA6 


AAACTTGGGG 


GAGCAGCTCA 


CAGGAGAATT 


TACATAGAGC 


TGAGTCTGAA 


240 


TGTCCACCCC 


TAGAGAAGTT 


GAACCTCCCC 


CTGAGCTGTC 


TGTGCTCAGC 


AGGGGGCAGA 


300 


CAGCACATGG 


AGGGCGCAGG 


CCAGTCCACA 


GAGGAGCGCT 


GACTTAAACA 


GCTGTCCTCC 


360 


CTTCCCAGNT 


GTTAGNATTT 


CTACACNATG 


TACATTGTTG 


TGCTTTATTG 


CTCCCACCCC 


420 


AGCCTCTGCC 


TGAAGCCCAG 


GGTGGAGGGA 


GGAACACCTG 


AACCAGATTC 


AGCTGGGTGG 


480 


GAGCACCAAG 


CCCCCAATGG 


CTGGTGCTCT 


GTAAGNTTCT 


GAGGCTGCCA 


GGCTCNNGNC 


540 


ACTCGGAGCT 


GCAGACCAAG 


GGTACAGAGC 


CAGCCGACCA 


GCCAGCTCTT 


CTGAGGAGCA 


600 


GCAGAAALAG 


AAGGATCAGT 


GCTGGACAAG 


ATGCAGGCTG 


GGTAGGAGCA 


GGCAGGGTGG 


660 




wLAAWjCCAT 


VaAGTCTGGGG 


CAGAGGGTTG 


TGCACATGGG 


GCTGTGGGGG 


720 


V-.V- 1 \j wioLrVrL. 


1 L. AL.w4.V3 


AU A\j AAL.L. A I 


1VTGTAGCAG 


AGC TTGACCtG 


AGGAAGGGTG 


*7 O A 

780 


TGAGGACCAC 


TGGCCTGGGG 


CCCCTTGGAG 


CACAGAGCTG 


CTTCCAGTTC 


ATCAGGGGCT 


840 


GGCAGTGGCA 


GGCCCCAGCT 


CGGGGCGTGT 


GCATTGGGAG 


ATTCAGGGAG 


AACCNGCAGG 


900 


CAGCCTGTGG 


GGCTGGGTGA 


GGAGGGAAGG 


CTGTGGTGGT 


ATCTTTGAGT 


NTGTCTTGGC 


960 


CGGGCCAGGG 


GTTAACACAG 


GATCAGTTGT 


TCTCAAAGAG 


TGAGAGCAGA 


TCATCACTGC 


1020 


TGTTTGTGGG 


GAGGTCAGGT 


GGACCCAGGT 


AGGAGAGTAG 


TTCATCAGGA 


TTGGTCAGTT 


1080 


CTGGGAGCAG 


ANCNAGAGCT 


GGTTCTGAGG 


CCTCCCCTGC 


CCCAGTCATG 


CTGGGAGGCC 


1140 


CCATCATGCC 


CGTGGCAGNA 


TTGAAAGCCA 


GTTCACTGAT 


GGGCCCTGTG 


TTTGGTTGGC 


1200 


CCAGGGGCTG 


CCTGGATGCA 


GGGGAAGGGT 


TTGGATGGTG 


TAGCTGGGTG 


CCTGGGGTGG 


1260 


GCCCGAGGTT 


GGGAGTGTGC 


AGTCCTGGTG 


GTCCAGGATT 


ATGGGCTGGG 


TCCCAGGTGA 


1320 


CCCTGCCTGG 


TGCCCATCCT 


GGCCTGGGAG 


GCAAGGAGCA 


GACTTGNNNN 


GAGTCAGGAG 


1380 


GCTGCTGGGA 


ATGTCAGACT 


GGTAGGAGAT 


TGGGGGTGGC 


CCCTGGGGTG 


AACTCAGCAA 


1440 


GGTTTGGGGT 


GGCGGGTGTA 


GCAGATGGGA 


AAGATTCTGG 


GAAGGTTCCA 


GGCCCCCAGG 


1500 
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AAGTTGGAAC 


CCTGGCTGGG 


NTAGTCGNTG 


GGGGTAGGGN 


CCNAAGGGGC 


TGCAATGGGG 


1560 


CAAANGGGCA 


GCGCCAGGGC 


CCAGGGCCGC 


AATCATCTCC 


ATCACACTGG 


GCATGAGCAC 


1620 


GTGGGCAGGG 


CTCACAGTAC 


GGCAGCGCTT 


CAGCACTGGG 


CCCGTCCGGC 


TCCTCCTTGA 


1680 


TGTGTAGGTC 


AGGCTTCACA 


GGCACTGGTT 


TCCAGCTGCA 


TGTGGGGTCG ATGGTGATCT 


1740 


CCTCGTAGTC 


TGAGTTCTGA 


ATATAAATCA 


GGATGCCTAG 


CATATACTGA TCCACCTCCA 


1800 


GGCCCTCCAG 


CAATGCCGTC 


TTGTTGCACA 


CCGGGACACC 


TCCAGGTCCC 


CCGCTCACAG 


1860 


TTAAGCTGTA 


AGTATGACTC 


CAGGTCAAAG 


CACTGTATGT 


GGCGACAGTC 


GTGACCACGA 


1920 


GCTGGGAGCT 


GGATCCTGCG 


GAAGGTGATG 


GGGCACTTCA GGGACACCTT GATAGCCGTC 


1980 


TGCTCCACCC 


CATCCTCTCC 


ATTGGGCCCA 


GGGGTGCCAG 


GGATGGTGCC 


GCTACTGAAG 


2040 


TTCCGCTTTA 


TCTTGGTGAT 


GCAGTGCTCA 


GCTGGCAAGN 


GGCGCTTCTT 


GAGGAGACCC 


2100 


TGCAGCACAG 


AGCGGACAGA 


CGGACGGTGC 


ACCAGCTGCA 


GCACGAAGAG 


GTGGGAGCAG 


2160 


CAGCAGGCGG 


TGACGGTGAT 


CTGGATGGTG 


TTGCGACCCG 


GCTGGCACAC ATGCTTCAGG 


2220 


TAGAGGGGCT 


TGTGCGAGGT 


CTTGTTGTCT 


CCACGCTCGA 


TGCTGAGGGG 


TGTGNCATTG 


2280 


ACGCTGACCT 


GCACCGNNGC 


CGGCCANTTG 


GTGGTTCATC 


TGCCGGTCCT 


CGTGGTGGTA 


2340 


GCACTTGAAC 


TGCGAACTGA 


GCTCTAAGTC 


AGGTCTCAGC 


ATCAGGGTTT 


TATACACAGA 


2400 


GTCTCGNAGC 


TGGAAGACAT 


GGTTGCTCAC 


AGCCAGGTTG 


TGCTGCAGGC 


GGAAGGGCTC 


2460 


CAGGACCACC 


CCATCCCGAA 


CTGGGAAGGT 


CAGGCGCAGC 


TCGTCACAGG 


GGACGCTTCC 


2520 


AGAGGGTGAT 


GGGTGCAAGG 


AGGCTGAGGC 


CTGGCTTGAG 


GTCAGGCAGG AAAGGAGACT 


2580 


TGACCTCCTG 


GCTTGGGGAC 


ACGTAGGGAG 


CACTGCTGCT AGGCGTCATG 


GGTGGTGTGG 


2640 


GATTCCCCGG 


CAGTGGGGAG 


CTGGGGTAAG 


CAGGGATGGA GCGGGAAGGC 


CCACTCAAGC 


2700 


CAGGCTGGCT 


GTAGCTGATG 


CTGCCCCCGT 


TGAANCTGGC 


GCCCTGCCCG 


TTGAACTGCT 


2760 


CTGTGGGCTT 


GTAATGCAGN 


CCTGTGGGGC 


CAGGGGCAGA 


CAGGGACTGG GCCATGCCCT 


2820 


GCTGCAGGGG 


GAGTCTGTGT 


CCTGCGTAGG 


AGGAGGCAGG 


GCCAGGGGGC 


TGCCCGGGAC 


2880 


CAGGAGCATA 


CTGGGCAGTG 


CTGGCTGCAT 


ACTGGCCNNC 


TTGCAGATAT 


TGNTGNCNAG 


2940 


GATACACCTC 


TGAGTAGGTC 


TCTCTTGACC 


CNCTGTCGAG 


GCAGCGGCTG 


CCCCTGGGGA 


3000 


GGCCCAGGGT 


AGCCATGCTG 


AGGCAGTCGC 


TGTCCTGCAT 


ACAAGGGTGT 


CATGCCTGCT 


3060 


GCCCGTGTAG 


GATTCATGGC 


CATGGAGGAA 


AGGCCTGAGG 


GGCCCATCAT 


TCCTCCCATG 


3120 


CCTGAAGGGT 


TCATGCCAGG 


GGGTACACTG 


GGCCCTCGAG 


GACCTCCATG 


CTGCAGAAAC 


3180 


TGGCTGTTAA 


AAGATTGCCC 


AGTCCCCATC 


GCTCCATATT 


GGCTTAGCTT 


CCTGGCTCTN 


3240 


CTTCTCTTGC 


AGGGNNGCCA 


CAGTAGCTGT 


GGCTGTGGCT 


GTGGCCGTNC 


TGCAGCAGCA 


3300 


GCCATGGCAG 


CTGCAGCTCT 


GCTTGTGTGA 


AGTCAG 






3336 



-165- 



SUBSTITUTE SHEET (RULE 26) 



WO 98/53071 



PCT/US98/10547 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW088 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 9. .686 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: complement (612.. 909) 

(D) OTHER INFORMATION: /label= SAC_24336 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

GTCGACCC GCT GCG GTC TAT CTG GGC AGC ACC ATG CTC GCG CTC CGA AGC 50 
Ala Ala Val Tyr Leu Gly Ser Thr Met Leu Ala Leu Arg Ser 
15 10 

GGC CTG AGG ACG GCG CTG GCC CCG CGG GCT CTG ACG CCT CAG GTA TGT 98 
Gly Leu Arg Thr Ala Leu Ala Pro Arg Ala Leu Thr Pro Gin Val Cys 
15 20 25 30 

TCA CCT TTT GCT AC A GGC CCC AGG CAA AGC GAT GGG ACA TTG TAT GAA 146 
Ser Pro Phe Ala Thr Gly Pro Arg Gin Ser Asp Gly Thr Leu Tyr Glu 
35 40 45 

TTC CGC ACC TAC TCT CTG AAG CCC TCA AAG ACA AAT GCG TTC CTG CAG 194 
Phe Arg Thr Tyr Ser Leu Lys Pro Ser Lys Thr Asn Ala Phe Leu Gin 
50 55 60 

AAT TTT CAG AAA TAC GTT CAC CTT CGG ACA GCT CAC TCT GAG ATG ATT 242 
Asn Phe Gin Lys Tyr Val His Leu Arg Thr Ala His Ser Glu Met lie 
65 70 75 

GGA TAC TGG ACC GTA GAA TTT GGA GGC AAA ATA AAC AGA GTG TTC CAT 290 
Gly Tyr Trp Thr Val Glu Phe Gly Gly Lys He Asn Arg Val Phe His 
80 85 90 

ATT TGG AAG TAT GAT AAT TTT GCT CAT CGA ACT GCC GTC CGC AAA GCC 338 
He Trp Lys Tyr Asp Asn Phe Ala His Arg Thr Ala Val Arg Lys Ala 
95 100 105 110 

TTG GCC AAG GAT AAG GAA TGG CAA GAA CAA TTC CTC ATT TCA AAT TTG 386 
Leu Ala Lys Asp Lys Glu Trp Gin Glu Gin Phe Leu He Ser Asn Leu 
115 120 125 

GCT TTC ATG GAT GAA CAA GAG GTT GAG ATT ACC TAC TTG GTA CCC TGG 434 
Ala Phe Met Asp Glu Gin Glu Val Glu He Thr Tyr Leu Val Pro Trp 
130 135 140 
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TGT AAA ATA AGA ACA CCT CCC AAG GAA GGA GTC TAT GAA CTG GCT ACC 482 
Cys Lys lie Arg Thr Pro Pro Lys Glu Gly Val Tyr Glu Leu Ala Thr 
145 150 155 

TTT CAG ATG AAG CCT GGT GGC CCA GCT CTA TGG GGT GAT GCA TTC AAA 530 
Phe Gin Met Lys Pro Gly Gly Pro Ala Leu Trp Gly Asp Ala Phe Lys 
160 165 170 

AGG GCA ATA AAC GCC CAT GTT GAG CTT GGC TAC TCT ACA CTA GTT GGT 578 
Arg Ala lie Asn Ala His Val Glu Leu Gly Tyr Ser Thr Leu Val Gly 
175 180 185 190 

GTT TTC CAC ACT GAA TAT GGA GCC CTC AAC AGA GTT CGG GAA AGT GTC 626 
Val Phe His Thr Glu Tyr Gly Ala Leu Asn Arg Val Arg Glu Ser Val 
195 200 205 

AAT TAC CTC GAG TCT CAG CAG AAT ATG TTC CTG ATT CCA ACC TCA TTT 674 
Asn Tyr Leu Glu Ser Gin Gin Asn Met Phe Leu lie Pro Thr Ser Phe 
210 215 220 

TCA CCA TTG AAG TAGTTTCCTA CCGAGCGCTG AGCATTCCCA TCACGGAGGT 726 
Ser Pro Leu Lys 
225 



GAGATGCGTT 


TCCCCCAGAC 


AATCGCCCTG 


TTCATTGTCT 


GAAGGAGGTG 


CTGAGCTACC 


786 


GCACTGCCCT 


CTGCATGTCT 


GAGACTTCAT 


GTATCCTTTA 


CTACCAGTCG 


CTCAGGAAAT 


846 


GTTTCTGTCC 


CTTTTCATGT 


CTACTGGATG 


TCAGAAAGTG 


GTGTTATTAC 


TCCATGCCCT 


906 


GGTACTTCAT 




TGGTTTTATG 


ATTAAGTCTT 


GTCCTCAAAA 


TATGATTTTG 


966 


AAATTTTTTC 


CCTCAAAATA 


TTTGGAAATT 


AAATAAATAC 


AATATCTGCC 


TTTTTATTGC 


1026 


GTTCATTTTC 


CCTGAAATTT 


TTAACTGTTT 


TTACCAGTAA 


TGTTTACTTT 


GTGATTTCAA 


1086 


ATGGGATTGA 


TAGAAGTCAG 


AGAAAGAACT 


GGAAGAGCTT 


GAAGGGGCTC 


GAGACCCCAT 


1146 


ATGTACAACA 


ATGCCAAGCA 


ACCAGAGCTT 


CCAGGGACTA 


AGCCACTACC 


TAAAGACTAT 


1206 


ACATGGACTG 


ACCCTGGACT 


CTGACCTCAT 


AGGTAGCAAT 


GAATATCCTA 


GTAAGAGCAC 


1266 


CAGTGGAAGG 


GGAAGCCCTG 


GGTCCTGCTA 


AGACTGAACC 


CCAGTGAACG 


TGATTGTTGT 


1326 


GGGGAGAGTG 


GTAATGGGGG 


GAGGATGGGG 


AGGGGAAGCC 


CATATAGAAG 


GGGAGGGAGA 


1386 


GGGGTTAGGG 


GGATGTCGGC 


CCAGAAACCA 


GGAAGGGGAA 


TAACAATTGA 


AATGTAAATA 


1446 


AGAAATACTC 


AAGTTAATAA 


AGATGGAGAA 


AAGAAAAAGA 


AAGTCGAC 




1494 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
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Ala Ala Val Tyr Leu Gly Ser Thr Met Leu Ala Leu Arg Ser Gly Leu 
15 10 15 

Arg Thr Ala Leu Ala Pro Arg Ala L u Thr Pro Gin Val Cys Ser Pro 
20 25 30 

Phe Ala Thr Gly Pro Arg Gin Ser Asp Gly Thr Leu Tyr Glu Phe Arg 
35 40 45 

Thr Tyr Ser Leu Lys Pro Ser Lys Thr Asn Ala Phe Leu Gin Asn Phe 
50 55 60 

Gin Lys Tyr Val His Leu Arg Thr Ala His Ser Glu Met lie Gly Tyr 
65 70 75 80 

Trp Thr Val Glu Phe Gly Gly Lys lie Asn Arg Val Phe His lie Trp 
85 90 95 

Lys Tyr Asp Asn Phe Ala His Arg Thr Ala Val Arg Lys Ala Leu Ala 
100 105 110 

Lys Asp Lys Glu Trp Gin Glu Gin Phe Leu lie Ser Asn Leu Ala Phe 
115 120 125 

Met Asp Glu Gin Glu Val Glu lie Thr Tyr Leu Val Pro Trp Cys Lys 
130 135 140 

lie Arg Thr Pro Pro Lys Glu Gly Val Tyr Glu Leu Ala Thr Phe Gin 
145 150 155 160 

Met Lys Pro Gly Gly Pro Ala Leu Trp Gly Asp Ala Phe Lys Arg Ala 
165 170 175 

He Asn Ala His Val Glu Leu Gly Tyr Ser Thr Leu Val Gly Val Phe 
180 185 190 

His Thr Glu Tyr Gly Ala Leu Asn Arg Val Arg Glu Ser Val Asn Tyr 
195 200 205 

Leu Glu Ser Gin Gin Asn Met Phe Leu He Pro Thr Ser Phe Ser Pro 
210 215 220 

Leu Lys 
225 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
<B) CLONE: HW089 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 
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(B) LOCATION: 8.. 1006 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 961.. 1076 

(D) OTHER INFORMATION: /label= SAC_24461 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

GTCGACC AAG CAA CAG CAG GGA AGT GAA TTT CAG ACA CTG CTG GAG GCT 49 
Lys Gin Gin Gin Gly Ser Glu Phe Gin Thr Leu Leu Glu Ala 
15 10 

GAG GAG ACC AAA GCC GCC TCT GAC GCT TCA AAT AAA AAG CTT TGG TCC 97 
Glu Glu Thr Lys Ala Ala Ser Asp Ala Ser Asn Lys Lys Leu Trp Ser 
15 20 25 " 30 

CTC AGA ACT CTA CCC TTC CAG CCC TGC TCT GAA GCC TCA GAC GTG AGA 145 
Leu Arg Thr Leu Pro Phe Gin Pro Cys Ser Glu Ala Ser Asp Val Arg 
35 40 45 

ACC ATG TCT GTG ACC CTG CAA GCT GTC CTC TAC AGT CTT CAG GAG GAG 193 
Thr Met Ser Val Thr Leu Gin Ala Val Leu Tyr Ser Leu Gin Glu Glu 
50 55 60 

CAG TCC AGG CTC AAG ACG AGG CTG CAG GAA CTA CAG CAG CTG AAA AGG 241 
Gin Ser Arg Leu Lys Thr Arg Leu Gin Glu Leu Gin Gin Leu Lys Arg 
65 70 75 

GAG CGC ATA GAT GTT CCC AGA GAC AAG ATC CCA TTC CCG GTG CCT GAA 289 
Glu Arg He Asp Val Pro Arg Asp Lys He Pro Phe Pro Val Pro Glu 
80 85 90 

GTC CCC GTG GTA TTC CAA GGT CTA ACT AAG CAG AGC GGC CAA GCA CCC 337 
Val Pro Val Val Phe Gin Gly Leu Thr Lys Gin Ser Gly Gin Ala Pro 
95 100 105 110 

AAG TCT CTA GTT TCT AAC TTG AAG ATC TGC TGC CCT CTG CCT GGA GGT 385 
Lys Ser Leu Val Ser Asn Leu Lys He Cys Cys Pro Leu Pro Gly Gly 
115 120 125 

TCT GCT CTG ATC ACC TTT GAT GAC CCC AAA GTG GTT GAG CGG TTG CTA 433 
Ser Ala Leu He Thr Phe Asp Asp Pro Lys Val Val Glu Arg Leu Leu 
130 135 140 

CAA CAA AAG GAA CAT AAA GTT GAC ATC GAG GAC TGC CGG CTG CGG GTG 481 
Gin Gin Lys Glu His Lys Val Asp He Glu Asp Cys Arg Leu Arg Val 
145 150 155 

CAG ATC CAG CCC TTG GAG CTG CCT ATG GTG ACC AAC ATC CAG GTG TCC 529 
Gin He Gin Pro Leu Glu Leu Pro Met Val Thr Asn He Gin Val Ser 
160 165 170 

AGC CAG CCG GAT AGC CAC AGG GTG CTA GTC AGT GGC TTT CCT ACT GGA 577 
Ser Gin Pro Asp Ser His Arg Val Leu Val Ser Gly Phe Pro Thr Gly 
175 180 185 190 

CTT AGG CTG AGT GAG GAG GAG CTA CTG GAC AAG TTA GAG ATC TTC TTC 625 
Leu Arg Leu Ser Glu Glu Glu Leu Leu Asp Lys Leu Glu He Phe Phe 
195 200 205 

GGC AAG GCC AAG AAT GGA GGT GGG GAT GTG GAG ACC CGG GAG ATG CTG 673 
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Gly Lys Ala Lys Asn Gly Gly Gly Asp Val Glu Thr Arg Glu Met Leu 
210 215 220 

CAA GGG ACT GTC ATG CTG GGA TTT GCT AAT GAA GAA GTG GCC CAG CAC 721 
Gin Gly Thr Val Met Leu Gly Phe Ala Asn Glu Glu Val Ala Gin His 
225 230 235 

TTA TGC CAG ATT GGC CAG TTC AGA GTC CCA CTG GGT CGG CAG CAG GTC 769 
Leu Cys Gin lie Gly Gin Phe Arg Val Pro Leu Gly Arg Gin Gin Val 
240 245 250 

CTT TTG AGG GTC TCT CCC TAT GTG AGT GGG GAG ATC CAG GAA GCT GAG 817 
Leu Leu Arg Val Ser Pro Tyr Val Ser Gly Glu lie Gin Glu Ala Glu 
255 260 265 270 

ATC AAA TTC CAG CAA GGA CCT CAT TCA GTG CTG GTG ACG AAT ATT CCC 865 
lie Lys Phe Gin Gin Ala Pro His Ser Val Leu Val Thr Asn lie Pro 
275 280 285 

GAC GTC CTG GAT GTG CAG GAA CTG CAT GAC ATT CTG GAG ATC CAC TTC 913 
Asp Val Leu Asp Val Gin Glu Leu His Asp lie Leu Glu lie His Phe 
290 295 300 

CAG AAG CCT ACC CGT GGG GGC GGA GAG GTG GAG GCC CTG GCA GTT GTA 961 
Gin Lys Pro Thr Arg Gly Gly Gly Glu Val Glu Ala Leu Ala Val Val 
305 310 315 

CCT GTG GGG CAG CAG GGC CTG GCT GTC TTT ACT TCT GAG TCA AGC 1006 
Pro Val Gly Gin Gin Gly Leu Ala Val Phe Thr Ser Glu Ser Ser 
320 325 330 

TAGGGCTGCA TGTCCACCTG CTGGGCATGG GACTGCTCAT TGCCTAATGC TACCAAGAAC 1066 

AGTGAAAGGC GTCGAC 1082 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Lys Gin Gin Gin Gly Ser Glu Phe Gin Thr Leu Leu Glu Ala Glu Glu 
15 10 15 

Thr Lys Ala Ala Ser Asp Ala Ser Asn Lys Lys Leu Trp Ser Leu Arg 
20 25 30 

Thr Leu Pro Phe Gin Pro Cys Ser Glu Ala Ser Asp Val Arg Thr Met 
35 40 45 

Ser Val Thr Leu Gin Ala Val Leu Tyr Ser Leu Gin Glu Glu Gin Ser 
50 55 60 

Arg Leu Lys Thr Arg Leu Gin Glu Leu Gin Gin Leu Lys Arg Glu Arg 
65 70 75 80 

lie Asp Val Pro Arg Asp Lys He Pro Phe Pro Val Pro Glu Val Pro 
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85 



90 



95 



Val Val Phe Gin Gly Leu Thr Lys Gin Ser Gly Gin Ala Pro Lys Ser 
100 105 110 

Leu Val Ser Asn Leu Lys lie Cys Cys Pro Leu Pro Gly Gly Ser Ala 
115 120 125 

Leu lie Thr Phe Asp Asp Pro Lys Val Val Glu Arg Leu Leu Gin Gin 
130 135 140 

Lys Glu His Lys Val Asp lie Glu Asp Cys Arg Leu Arg Val Gin lie 
145 150 155 160 

Gin Pro Leu Glu Leu Pro Met Val Thr Asn lie Gin Val Ser Ser Gin 
165 170 175 

Pro Asp Ser His Arg Val Leu Val Ser Gly Phe Pro Thr Gly Leu Arg 
180 185 190 

Leu Ser Glu Glu Glu Leu Leu Asp Lys Leu Glu lie Phe Phe Gly Lys 
195 200 205 

Ala Lys Asn Gly Gly Gly Asp Val Glu Thr Arg Glu Met Leu Gin Gly 
210 215 220 

Thr Val Met Leu Gly Phe Ala Asn Glu Glu Val Ala Gin His Leu Cys 
225 230 235 240 

Gin lie Gly Gin Phe Arg Val Pro Leu Gly Arg Gin Gin Val Leu Leu 
245 250 255 

Arg Val Ser Pro Tyr Val Ser Gly Glu He Gin Glu Ala Glu He Lys 
260 265 270 

Phe Gin Gin Ala Pro His Ser Val Leu Val Thr Asn He Pro Asp Val 
275 280 285 

Leu Asp Val Gin Glu Leu His Asp He Leu Glu He His Phe Gin Lys 
290 295 300 

Pro Thr Arg Gly Gly Gly Glu Val Glu Ala Leu Ala Val Val Pro Val 
305 310 315 320 

Gly Gin Gin Gly Leu Ala Val Phe Thr Ser Glu Ser Ser 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2263 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(Vii) IMMEDIATE SOURCE: 



325 



330 



(B) CLONE: HW090 
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(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 335.. 937 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1355.. 1434 

(D) OTHER INFORMATION: / label = SAC_24197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

GTCGACCTTT AAGCTTACCA TCAGAACAAC AAATCAAAAT GTAAACTTAA AATATAACCA 60 

AAAGAGGGAC AGCTCTTTAG GAAAAGGAAA AAACCTTAAA TAGTGAATAA ACAACTACAA 120 

CCACTTAACC ATTGTAGGCT TAAAAGCAGC CATCAATAAA GAAAGCGTTC AAGCTCAACA 180 

TACACACACA CACACACACA CACACACGCT CCCTCCCTCC GGCGCTCTAC TCCGCGCCCG 240 

CCCGCCCGCT CGCCCGCCCG CCCGCCTCGC ACGCACCGGC CTCGGCTCCC CGCGCCCGCC 300 

GCCGAGCGAA GCTGCTCTGG GATCGGAGCA GCTC ATG GAG AAA GTG CCG GGC 352 

Met Glu Lys Val Pro Gly 
1 5 

GAC ATG GAG ATA GAG CGC AGG GAG AGG AAC GAG GAG CTG TCC GAG GCG 400 
Asp Met Glu lie Glu Arg Arg Glu Arg Asn Glu Glu Leu Ser Glu Ala 
10 15 20 

GAG AGG AAG GCG GTT CAG GCT ACG TGG GCC CGG CTG TAT GCC AAC TGC 448 
Glu Arg Lys Ala Val Gin Ala Thr Trp Ala Arg Leu Tyr Ala Asn Cys 
25 30 35 

GAG GAC GTG GGG GTG GCC ATC CTG GTG AGG TTC TTT GTG AAC TTC CCG 496 
Glu Asp Val Gly Val Ala lie Leu Val Arg Phe Phe Val Asn Phe Pro 
40 45 50 

TCG GCC AAG CAG TAC TTC AGC CAG TTT AAA CAC ATG GAG GAT CCG TTG 544 
Ser Ala Lys Gin Tyr Phe Ser Gin Phe Lys His Met Glu Asp Pro Leu 
55 60 65 70 

GAG ATG GAG AGG AGT CCT CAG CTG CGG AAA CAT GCC TGC CGG GTC ATG 592 
Glu Met Glu Arg Ser Pro Gin Leu Arg Lys His Ala Cys Arg Val Met 
75 80 85 

GGG GCC CTC AAC ACG GTC GTG GAG AAC CTG CAT GAC CCA GAC AAG GTA 640 
Gly Ala Leu Asn Thr Val Val Glu Asn Leu His Asp Pro Asp Lys Val 
90 95 100 

TCC TCT GTG CTC GCG CTG GTT GGC AAG GCC CAC GCC CTC AAG CAC AAG 688 
Ser Ser Val Leu Ala Leu Val Gly Lys Ala His Ala Leu Lys His Lys 
105 110 115 

GTG GAA CCT ATG TAC TTT AAG ATT CTC TCT GGG GTC ATT CTG GAC GTG 736 
Val Glu Pro Met Tyr Phe Lys lie Leu Ser Gly Val lie Leu Asp Val 
120 125 130 

ATC GCC GAG GAG TTT GCC AAT GAC TTC CCC GTC GAG ACG CAG AAA GCC 784 
lie Ala Glu Glu Phe Ala Asn Asp Phe Pro Val Glu Thr Gin Lys Ala 
135 140 145 150 

TGG ACC AAG CTG CGC GGT CTC ATC TAC AGC CAT GTG ACC GCG GCC TAC 832 
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Trp Thr Lys Leu Arg Gly Leu lie Tyr Ser His Val Thr Ala Ala Tyr 
155 160 165 

AAG GAA GTG GGC TGG GTA CAG CAG CCT ACC AGC CAC ACT GCC CTC TTC 880 
Lys Glu Val Gly Trp Val Gin Gin Pro Thr Ser His Thr Ala Leu Phe 
170 175 180 

AGG GCC ATA GGA CCC CTC CCA GCT CCC CCC TCC CTG GCA GCA CTT TGG 928 
Arg Ala lie Gly Pro Leu Pro Ala Pro Pro Ser Leu Ala Ala Leu Trp 
185 190 195 



GCA GAA GGC TGACTTCTGA AGACACTCCT TGACCTTCCA TTTCTGGGTG 977 
Ala Glu Gly 
200 



CCAAGGAAGC 


TGGAGGAACC 


CCTGACTGGA 


CTTCCCGAAA 


GGAGGCCTTC ATCATGGCTG 


1037 


TGGTCTCCCA 


CGTTACTGGG 


AGGTGGGGCT 


GGCTACCTGG 


ATGCTGAATT 


TAGCAGAGGG 


1097 


GAACCCTTCT 


GCCTCGCAGC 


AGAGGGCACA 


CCCTCCTTAG 


CTTTCACTGT 


CAAGAGAGGG 


1157 


ACCTGGCATG 


TGGCCGGTGG 


GAAGCAAGGA 


CAAGCCTTTG 


AGTCACTTCA 


GAAAGGAACT 


1217 


AGCCCTGCTT 


CACTCTGCCC 


TACCAGCATG 


CAGGTTCTCC 


TCACCATCTC 


TGAGCATTGC 


1277 


GCACACACAC 


GTCTACCATA 


TACACAGATA 


TATTCTATAT 


ACGAGCTCTA 


TATAAATATA 


1337 


TATACACACA 


CACACGTACA 


TATCTGTCGC 


GTGCACCTGC 


AGGCCCAGAA 


GCCTAGGCCT 


1397 


CGCACAGCTG 


TGGGAGACCC 


GGAAGAGTCC 


TGGCCGTACT 


TCAGGGCAGC 


TCTACCGGGA 


1457 


GCAGGGAGAA 


AGCTTGCAAC 


AGCACGAGAG 


GGCGGGGCCG 


GGTGGGAGAG 


TCGGGGTGAC 


1517 


TCAGCTCACC 


CCGCACAGCT 


CAGTGCCCTG 


CAATCTTCCC 


CTTCCGCCCC 


CCAAAAGAGA 


1577 


ACACTTGGTT 


TGCCCGGTGT 


CGGGGCGCAC 


ACCTGTAATC 


CCCGAGTTTG 


GGACGCAGAG 


1637 


GAAGGACAAT 


CTTTGCAAGT 


CTGAGGTCAG 


GTGGTCTACA 


CTTGGTCTCC 


ATCTTGGTGA 


1697 


GTTTCCGGCT 


AGCTGGGGTC 


GACTCAGGGA 


GACCCCGTGT 


CAAACAAAAC 


AAGAAGCGAA 


1757 


GAGAAGATCT 


GGCTCCCCTC 


CCCTCCCCAA 


GGGTTTGCCC 


GACTGACTTG 


CTCCGGAAAG 


1817 


GGGCACACCG 


AGAATTCAGG 


AGACCCCAGT 


GGGGTGCTGG 


CCAGTGTAGG 


TGTCCGGCGT 


1877 


TTGGGGGTGG 


GGTGAGGGGC 


AGCTTCAGCG 


TGGAGCCCTA 


GAATGTCAGC 


CTCACCCGCT 


1937 


CACTTCAGAC 


AGCAGCCAGG 


CCACAGGACC 


AGGGGCCATA 


CTCAAACCCC 


TCAGCAGAGA 


1997 


CCAGGGCGCC 


TCTCTCCTGC 


CCTCCTCGGG 


GCCACCTGCC 


CATCCTCTCC 


AACTAGTCTC 


2057 


CGTAACATTC 


GCTGGGGACC 


CCGTGTGTAC 


CCCTTGGTCA 


CGTGTACTGT 


CCTGGAGGAC 


2117 


GTCTTACTGC 


TGTGGCTACC 


GTGCCTGCGT 


CCCCTCTGGT 


CCCACCCTCA 


CGCAGCCTTC 


2177 


CTGTAACTGC 


CTGTCTTTTT 


GTAGTTTCTG 


ATGTTTGTAA 


CCAGACCCAG 


CTGTGTCATT 


2237 


AAACAGGTCT 


GTTCTTCCTG 


TAAAAA 








2263 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Met Glu Lys Val Pro Gly Asp Met Glu He Glu Arg Arg Glu Arg Asn 
15 10 15 

Glu Glu Leu Ser Glu Ala Glu Arg Lys Ala Val Gin Ala Thr Trp Ala 
20 25 30 

Arg Leu Tyr Ala Asn Cys Glu Asp Val Gly Val Ala He Leu Val Arg 
35 40 45 

Phe Phe Val Asn Phe Pro Ser Ala Lys Gin Tyr Phe Ser Gin Phe Lys 
50 55 60 

His Met Glu Asp Pro Leu Glu Met Glu Arg Ser Pro Gin Leu Arg Lys 
65 70 75 80 

His Ala Cys Arg Val Met Gly Ala Leu Asn Thr Val Val Glu Asn Leu 
85 90 95 

His Asp Pro Asp Lys Val Ser Ser Val Leu Ala Leu Val Gly Lys Ala 
100 105 110 

His Ala Leu Lys His Lys Val Glu Pro Met Tyr Phe Lys He Leu Ser 
115 120 125 

Gly Val He Leu Asp Val He Ala Glu Glu Phe Ala Asn Asp Phe Pro 
130 135 140 

Val Glu Thr Gin Lys Ala Trp Thr Lys Leu Arg Gly Leu He Tyr Ser 
145 150 155 160 

His Val Thr Ala Ala Tyr Lys Glu Val Gly Trp Val Gin Gin Pro Thr 
165 170 175 

Ser His Thr Ala Leu Phe Arg Ala He Gly Pro Leu Pro Ala Pro Pro 
180 185 190 

Ser Leu Ala Ala Leu Trp Ala Glu Gly 
195 200 

(2) INFORMATION FOR SEQ ID NO: 82: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW092 
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(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: complement (303.. 566) 

(D) OTHER INFORMATION: /label= SAC_24320 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



GTCGACCTCG 


GAGCTGGGGA 


CCGAACCCAG 


GGCCTTGCGC 


TTGCTAGGCA 


AGCGCTCTAC 


60 


CACTGAGCTA 


AATCCCCAAC 


CCCTAGCCCT 


TTGTTTTTTA 


AGAAAGTCTA 


GTGTAGCCCA 


120 


GGTTGGCCTT 


AAATTCCTGA 


TCCTCCTGCC 


CCTATCTGTA 


ATCTTGGGAT 


TATAGGTGTG 


180 


GGCCATCACA 


CTGTTTATCC 


AGTGCTGAAT 


CAAACCCAGA 


ATCTTCCACA 


TGCTAGGCGA 


240 


GCAGTCTCCT 


AAATACTACA 


CGGCTATAGG 


TTCAGCCTAG 


ATCTCTCTTT 


TGAAGCTCTC 


300 


GTACACCTAT 


AGGGGCCACT 


AATCCTAAAG 


ACGCAGAGGA 


TGAAAACCAC 


CAGGCCCAAG 


360 


GGTTAACGTG 


GAAAACAGAG 


CCCTGGTGTC 


CCCCTGGGGA 


ACCAGCCCTT 


CATGTTCTAA 


420 


GACAGTCATC 


ACCCAGATTC 


CTGAAAACAA 


TAGCGGCTTC 


TTGTTTAGAA 


CGGAACCGCA 


480 


CCTGCTGGAA 


TACGTGTCAA 


CAAACAATCG 


CTTTGTTTTT 


CATATTCGCC 


ATCCAAGCTG 


540 


AGGCAATGGG 


ACTTCCTGAA 


GCGAGTACTG 


GAAGACTTAT 


TCTTAGAAGA 


GGACAGAATT 


600 


CTCATTTTAA 


CACGAGGCTC 


AGGGGCCGAG 


TGCTGGCCAT 


CTTCCTTCCT 


TCTTTTGTAC 


660 


TCCCTGGTGT 


TCATCTTATT 


TGGTTTTCAA 


CAGGTTTATT 


TCCATTGCAG 


CCCGGGGCTT 


720 


CCAACGCCCA 


AAAACGCCCA 


GCCCACGATG 


CCGTAAATGG 


TGAGCACGCT 


GGCTTCTGCT 


780 


TCCTCTCCCT 


GAGAACCCAG 


ACCCTTTGTT 


GTTCCGCCAT 


TCCAAAAGGC 


CCTGGGAAGG 


840 


TTCTTTGCTC 


CTCTGGTGAC 


AGGCTTGGGA 


TGAAGACTTG 


GGAACTCCTA 


TTTGCTAAAC 


900 


TCTCGCTCCT 


GGCTAAATCT 


TTCTGCTTGT 


TCTTTTTCCT 


CGTCTAATCA 


TGCTACCTGC 


960 


TACTACTTTT 


CCTGATGCAG 


TCTGACGAAC 


GTCTGTTGAA 


CACATCGGGA 


CAGTTTTAAA 


1020 


AGGAGTTATT 


TTGGTGGCAA 


AGGCGCTCAG 


TTGTGATTTA 


AAAGTGTTTT 


AAGTGATTCA 


1080 


TCCAAGGTTG 


TCTCCATGGA 


CTCCGGAAAT 


AACCACAGCC 


AAAATACCCC 


GGCGAGAGCA 


1140 


GAATATGAGA 


GGCAGATAGC 


ACCTATTATT 


ACTTTTATCT 


GGGTGCAGCC 


GGGCTCCAGA 


1200 


TTAATGGGGG 


CAAGCAAAGG 


TCCAGGCCTC 


AGACATGGCC 


AATTTTAAGC 


AAACTAGTGC 


1260 


TAGTGTTGAT 


AAGTATTCAA 


TAAGGGCTTG 


GAATCTAATT 


ATTTGTAAGA 


AGTGTCTTTG 


1320 


TTTCCTGATC 


TGTAAAGGAG 


ATGGGGAAAT 


ACTGTGATTT 


CGTGCCATTT 


TAAAAAAAAT 


1380 


GAAAAAGCAA 


CATAAAAAAA 


TGCAAATACA 


GAAGAAAAGA 


TGTGGGGTTT 


GAAGTTGGCT 


1440 


TTAGAGAGCT 


GTGTAACCTT 


GGGGAAGTCA 


CCTCGTCTCT 


CTGTGACTTG 


AATTCCTAAT 


1500 


CAACAATAAT 


GCTGGGGGAC 


AAGGTGCTCT 


CCAAATTCCC 


CTTCAGATCA 


GAAAGTCTGG 


1560 


GATAACTATC 


CCAGATAGAT 


CCAAACTTGT 


AAAGGAATCA 


AAAGGAATAG 


TCTAGTGTGA 


1620 


CAGATAACAC 


GGACTTGAAA 


CTTAGTTCAC 


CTGATATTTC 


TTAATGACTT 


AACACCACTA 


1680 
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TACCTCACCC CAGAGCTAGC CTGTCCCCTG TGTGCCCACC AGTAACCTTG AAATGTCGAC 1740 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW093 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 8.. 1357 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: 798.. 1385 

(D) OTHER INFORMATION: /label= SAC_24538 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

GTCGACT CTG CCT CAG CCG CTG CCG CCG CCA CTG CTG CTG CAG GAT CCG 49 
Leu Pro Gin Pro Leu Pro Pro Pro Leu Leu Leu Gin Asp Pro 
15 10 

CCG CAG CCA CCA GTC TCG CGC GTC GTC GCT CCC TCC TTG GAC AGT AAT 97 
Pro Gin Pro Pro Val Ser Arg Val Val Ala Pro Ser Leu Asp Ser Asn 
15 20 25 30 

TTT ATG AAT AAG CAT CAG AAG CCG GTA CTA ACA GGC CAG CGG TTC AAA 145 
Phe Met Asn Lys His Gin Lys Pro Val Leu Thr Gly Gin Arg Phe Lys 
35 40 45 

ACC CGG AAA AGG GAT GAA AAA GAG AAA TTC GAA CCC ACA GTC TTC AGG 193 
Thr Arg Lys Arg Asp Glu Lys Glu Lys Phe Glu Pro Thr Val Phe Arg 
50 55 60 

GAT ACA CTT GTC CAG GGG CTT AAT GAA GCT GGT GAT GAC CTT GAA GCT 241 
Asp Thr Leu Val Gin Gly Leu Asn Glu Ala Gly Asp Asp Leu Glu Ala 
65 70 75 

GTA GCC AAA TTC TTG GAC TCT ACT GGC TCA CGA CTA GAT TAT CGT CGC 289 
Val Ala Lys Phe Leu Asp Ser Thr Gly Ser Arg Leu Asp Tyr Arg Arg 
80 85 90 

TAT GCA GAC ACA CTC TTT GAT ATC CTG GTG GCT GGC AGC ATG CTT GCC 337 
Tyr Ala Asp Thr Leu Phe Asp lie Leu Val Ala Gly Ser Met Leu Ala 
95 100 105 110 

CCT GGA GGA ACA CGC ATA GAT GAT GGT GAC AAG ACC AAG ATG ACC AAC 385 
Pro Gly Gly Thr Arg He Asp Asp Gly Asp Lys Thr Lys Met Thr Asn 
115 120 125 
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CAC TGT GTG TTT TCA GCA AAT GAA GAT CAT GAA ACC ATC CGA AAC TAT 433 
His Cys Val Phe Ser Ala Asn Glu Asp His Glu Thr lie Arg Asn Tyr 
130 135 140 

GCT CAG GTC TTC AAC AAA CTC ATC AGG AGA TAC AAA TAT TTG GAA AAG 481 
Ala Gin Val Phe Asn Lys Leu lie Arg Arg Tyr Lys Tyr Leu Glu Lys 
145 150 155 

GCA TTT GAA GAT GAA ATG AAA AAG CTT CTC CTC TTC CTT AAA GCA TTT 529 
Ala Phe Glu Asp Glu Met Lys Lys Leu Leu Leu Phe Leu Lys Ala Phe 
160 165 170 

TCT GAA GCA GAG CAG ACA AAG TTG GCA ATG CTG TCT GGG ATC CTG TTG 577 
Ser Glu Ala Glu Gin Thr Lys Leu Ala Met Leu Ser Gly He Leu Leu 
175 180 185 190 

GGC AAT GGA ACC CTC CCG GCC ACC ATC CTT ACC AGT CTC TTC ACC GAC 625 
Gly Asn Gly Thr Leu Pro Ala Thr He Leu Thr Ser Leu Phe Thr Asp 
195 200 205 

AGC TTA GTC AAA GAA GGC ATC GCA GCC TCG TTC GCT GTC AAG CTT TTC 673 
Ser Leu Val Lys Glu Gly He Ala Ala Ser Phe Ala Val Lys Leu Phe 
210 215 220 

AAA GCC TGG ATG GCA GAG AAA GAT GCC AAT TCT GTC ACC TCT TCA TTG 721 
Lys Ala Trp Met Ala Glu Lys Asp Ala Asn Ser Val Thr Ser Ser Leu 
225 230 235 

AGA AAA GCC AAC CTG GAC AAG AGG CTG CTT GAA CTC TTC CCA GTG AAC 769 
Arg Lys Ala Asn Leu Asp Lys Arg Leu Leu Glu Leu Phe Pro Val Asn 
240 245 250 

AGA CAG AGT GTG GAT CAT TTC GCC AAG TAC TTC ACT GAC GCA GGA CTG 817 
Arg Gin Ser Val Asp His Phe Ala Lys Tyr Phe Thr Asp Ala Gly Leu 
255 260 265 270 

AAG GAG CTG TCA GAC TTT CTC CGA GTC CAG CAG TCA CTG GGC ACC AGG 865 
Lys Glu Leu Ser Asp Phe Leu Arg Val Gin Gin Ser Leu Gly Thr Arg 
275 280 285 

AAG GAA CTG CAG AAG GAG CTG CAA GAG CGA CTG TCT CAG GAA TGC CCC 913 
Lys Glu Leu Gin Lys Glu Leu Gin Glu Arg Leu Ser Gin Glu Cys Pro 
290 295 300 

ATC AAG GAG GTG GTG CTT TAC GTC AAA GAG GAA ATG AAA AGG AAC GAC 961 
He Lys Glu Val Val Leu Tyr Val Lys Glu Glu Met Lys Arg Asn Asp 
305 310 315 

CTT CCG GAG ACA GCT GTG ATC GGG CTG CTG TGG ACC TGC GTC ATG AAC 1009 
Leu Pro Glu Thr Ala Val He Gly Leu Leu Trp Thr Cys Val Met Asn 
320 325 330 

GCT GTG GAA TGG AAC AAG AAG GAG GAG CTG GTT GCT GAG CAG GCT CTT 1057 
Ala Val Glu Trp Asn Lys Lys Glu Glu Leu Val Ala Glu Gin Ala Leu 
335 340 345 350 

AAG CAC CTG AAG CAA TAC GCT CCC CTG CTG GCC GTG TTC AGT TCC CAA 1105 
Lys His Leu Lys Gin Tyr Ala Pro Leu Leu Ala Val Phe Ser Ser Gin 
355 360 365 

GGC CAG TCA GAG CTT GTC CTC CTG CAG AAG GTT CAG GAA TAC TGC TAT 1153 
Gly Gin Ser Glu Leu Val Leu Leu Gin Lys Val Gin Glu Tyr Cys Tyr 
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370 375 380 

GAC AAC ATC CAC TTC ATG AAA GCC TTC CAG AAG ATC GTG GTT CTC TTT 1201 
Asp Asn lie His Phe Met Lys Ala Phe Gin Lys He Val Val Leu Phe 
385 390 395 

TAT AAA GCC GAC GTT CTG AGC GAG GAG GCG ATA CTG AAG TGG TAT AAA 1249 
Tyr Lys Ala Asp Val Leu Ser Glu Glu Ala He Leu Lys Trp Tyr Lys 
400 405 410 

GAA GCA CAT GCC GCC AAA GGC AAA AGC GTC TTC CTT GAC CAG ATG AAG 1297 
Glu Ala His Ala Ala Lys Gly Lys Ser Val Phe Leu Asp Gin Met Lys 
415 420 425 430 

AAA TTT GTG GAG TGG TTA CAA AAT GCA GAA GAA GAA TCT GAA TCA GAA 1345 
Lys Phe Val Glu Trp Leu Gin Asn Ala Glu Glu Glu Ser Glu Ser Glu 
435 440 445 

GGT GAA GAA AGC TAAATGTCTC CAGCACACTG CCTAAAAA 1385 
Gly Glu Glu Ser 
450 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Leu Pro Gin Pro Leu Pro Pro Pro Leu Leu Leu Gin Asp Pro Pro Gin 
15 10 15 

Pro Pro Val Ser Arg Val Val Ala Pro Ser Leu Asp Ser Asn Phe Met 
20 25 30 

Asn Lys His Gin Lys Pro Val Leu Thr Gly Gin Arg Phe Lys Thr Arg 
35 40 45 

Lys Arg Asp Glu Lys Glu Lys Phe Glu Pro Thr Val Phe Arg Asp Thr 
50 55 60 

Leu Val Gin Gly Leu Asn Glu Ala Gly Asp Asp Leu Glu Ala Val Ala 
65 70 75 80 

Lys Phe Leu Asp Ser Thr Gly Ser Arg Leu Asp Tyr Arg Arg Tyr Ala 
85 90 95 

Asp Thr Leu Phe Asp He Leu Val Ala Gly Ser Met Leu Ala Pro Gly 
100 105 HO 

Gly Thr Arg He Asp Asp Gly Asp Lys Thr Lys Met Thr Asn His Cys 
115 120 125 

Val Phe Ser Ala Asn Glu Asp His Glu Thr He Arg Asn Tyr Ala Gin 
130 135 140 

Val Phe Asn Lys Leu He Arg Arg Tyr Lys Tyr Leu Glu Lys Ala Phe 
145 150 155 160 
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Glu Asp Glu Met Lys Lys Leu Leu Leu Phe Leu Lys Ala Phe Ser Glu 
165 170 175 

Ala Glu Gin Thr Lys Leu Ala Met Leu Ser Gly lie Leu Leu Gly Asn 
180 185 190 

Gly Thr Leu Pro Ala Thr lie Leu Thr Ser Leu Phe Thr Asp Ser Leu 
195 200 205 

Val Lys Glu Gly lie Ala Ala Ser Phe Ala Val Lys Leu Phe Lys Ala 
210 215 220 

Trp Met Ala Glu Lys Asp Ala Asn Ser Val Thr Ser Ser Leu Arg Lys 
225 230 235 240 

Ala Asn Leu Asp Lys Arg Leu Leu Glu Leu Phe Pro Val Asn Arg Gin 
245 250 255 

Ser Val Asp His Phe Ala Lys Tyr Phe Thr Asp Ala Gly Leu Lys Glu 
260 265 270 

Leu Ser Asp Phe Leu Arg Val Gin Gin Ser Leu Gly Thr Arg Lys Glu 
275 280 285 

Leu Gin Lys Glu Leu Gin Glu Arg Leu Ser Gin Glu Cys Pro lie Lys 
290 295 300 

Glu Val Val Leu Tyr Val Lys Glu Glu Met Lys Arg Asn Asp Leu Pro 
305 310 315 320 

Glu Thr Ala Val He Gly Leu Leu Trp Thr Cys Val Met Asn Ala Val 
325 330 335 

Glu Trp Asn Lys Lys Glu Glu Leu Val Ala Glu Gin Ala Leu Lys His 
340 345 350 

Leu Lys Gin Tyr Ala Pro Leu Leu Ala Val Phe Ser Ser Gin Gly Gin 
355 360 365 

Ser Glu Leu Val Leu Leu Gin Lys Val Gin Glu Tyr Cys Tyr Asp Asn 
370 375 380 

He His Phe Met Lys Ala Phe Gin Lys He Val Val Leu Phe Tyr Lys 
385 390 395 400 

Ala Asp Val Leu Ser Glu Glu Ala He Leu Lys Trp Tyr Lys Glu Ala 
405 410 415 

His Ala Ala Lys Gly Lys Ser Val Phe Leu Asp Gin Met Lys Lys Phe 
420 425 430 

Val Glu Trp Leu Gin Asn Ala Glu Glu Glu Ser Glu Ser Glu Gly Glu 
435 440 445 

Glu Ser 
450 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1463 base pairs 

(B) TYPE: nucleic acid 



179- 



SUBSTITUTE SHEET (RULE 26) 



WO 98/53071 



PCT/US98/10547 



<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW094 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 255.. 1238 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1050.. 1355 

(D) OTHER INFORMATION: /label= SAC_23896 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

GTCGACCCCG AACACCCACA CTCGGTCCCC AGCCGTTTTT CTCCCAGAGA TCCGGAAGCC 60 

CCGACGTCGT GCCCTACACC AAAGATCGAG AAAACCACTC AGACGCGGCT CCTAGGCGGC 120 

AGTTCGGTTC CCATAGCAAC CCTCGGAGCA CCCGCATCCC GCTTCCGGCC GGGTGGCTCG 180 

GGCGCTCCCG TGACGTCACG GCGCCGCTTT CCGGTGACTG AGTCGAGCTG AGGTTGTTGT 240 

GGGCCGGGGG CGCC ATG GGG GCG ACT GGC GAC ACC GAG CAG CCG CGG GGC 290 
Met Gly Ala Thr Gly Asp Thr Glu Gin Pro Arg Gly 
1 5 10 

CCC GGC GGG GCG GAG CGA GGC GGC CTG GAG CTG GGC GAC GCG GGC GCG 338 
Pro Gly Gly Ala Glu Arg Gly Gly Leu Glu Leu Gly Asp Ala Gly Ala 
15 20 25 

GCG GGC CAG CCG GTT CTC ACG AAC CCT TGG AAC ATA ATG ATC AAA CAC 386 
Ala Gly Gin Pro Val Leu Thr Asn Pro Trp Asn He Met He Lys His 
30 35 40 

CGG CAG GTG CAG CGA AGA GGC CGC CGA TCT CAG ATG ACC AC A AGT TTC 434 
Arg Gin Val Gin Arg Arg Gly Arg Arg Ser Gin Met Thr Thr Ser Phe 
45 50 55 60 

ACA GAT CCA GCC ATC TCT ATG GAT CTC CTC CGT GCT GTC CTG CAG CCT 482 
Thr Asp Pro Ala He Ser Met Asp Leu Leu Arg Ala Val Leu Gin Pro 
65 70 75 

AGC ATC AAT GAG GAG ATC CAG AGT GTC TTC AAC AAG TAC ATG AAG TTC 530 
Ser He Asn Glu Glu He Gin Ser Val Phe Asn Lys Tyr Met Lys Phe 
80 85 90 

TTC CAG AAG GCA GCG CTG AAT GTG CGA GAC AAC GTT GGT GAA GAG GTG 578 
Phe Gin Lys Ala Ala Leu Asn Val Arg Asp Asn Val Gly Glu Glu Val 
95 100 105 

GAT GCA GAA CAG TTG ATT CAG GAG GCC TGC CGC AGC TGC CTG GAG CAG 626 
Asp Ala Glu Gin Leu He Gin Glu Ala Cys Arg Ser Cys Leu Glu Gin 
110 115 120 
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GCA AAG CTA CTC TTT TCA GAT GGA GAA AAA GTG ATA CCC AGA TTG GCC 674 
Ala Lys Leu Leu Phe Ser Asp Gly Glu Lys Val lie Pro Arg Leu Ala 
125 130 135 140 

CAT GAA CTT CCA GGG ATC AAG CGT GGC CGG CAG GCA GAA GAG GAG TCC 722 
His Glu Leu Pro Gly lie Lys Arg Gly Arg Gin Ala Glu Glu Glu Ser 
145 150 155 

CAC CGA GGA AGC CCC ATT CCC AAA AAG AGG AAA GGT CGG CCT CCT GGA 770 
His Arg Gly Ser Pro lie Pro Lys Lys Arg Lys Gly Arg Pro Pro Gly 
160 165 170 

CAT GTC CTG TCA AAT GAC CGC GCA GCT GCT GGC ATG GTA TGG AAA CCA 818 
His Val Leu Ser Asn Asp Arg Ala Ala Ala Gly Met Val Trp Lys Pro 
175 180 185 

AAA TCC TGT GAA CCA ATT CGC CGA GAA GGC CCC AAG TGG GAC CCA GCT 866 
Lys Ser Cys Glu Pro lie Arg Arg Glu Gly Pro Lys Trp Asp Pro Ala 
190 195 200 

CGG CTG AAT GAA TCT ACC ACC TTT GTT TTG GGG TCT CGA GCC AAC AAG 914 
Arg Leu Asn Glu Ser Thr Thr Phe Val Leu Gly Ser Arg Ala Asn Lys 
205 210 215 220 

GCC TTA GGG ATG GGA GGC ACC AGA GGG AGG ATT TAC ATC AAG CAC CCA 962 
Ala Leu Gly Met Gly Gly Thr Arg Gly Arg He Tyr He Lys His Pro 
225 230 235 

CAC CTC TTT AAG TAT GCA GCA GAT CCT CAG GAC AAG CAC TGG CTG GCT 1010 
His Leu Phe Lys Tyr Ala Ala Asp Pro Gin Asp Lys His Trp Leu Ala 
240 245 250 

GAG CAG CAT CAT ATG CGG GCA ACA GGA GGA AAG ATG GCG TAC CTT CTC 1058 
Glu Gin His His Met Arg Ala Thr Gly Gly Lys Met Ala Tyr Leu Leu 
255 260 265 

ATT GAG GAA GAC ATC CGA GAC TTG GCT GCC AGC GAT GAC TAC AGA GGA 1106 
He Glu Glu Asp He Arg Asp Leu Ala Ala Ser Asp Asp Tyr Arg Gly 
270 275 280 

TGC TTG GAC CTG AAG TTG GAG GAG CTG AAG TCC TTT GTT TTG CCA TCC 1154 
Cys Leu Asp Leu Lys Leu Glu Glu Leu Lys Ser Phe Val Leu Pro Ser 
285 290 295 300 

TGG ATG GTT GAG AAG ATG CGG AAA TAC ATG GAG ACA CTA CGG ACA GAA 1202 
Trp Met Val Glu Lys Met Arg Lys Tyr Met Glu Thr Leu Arg Thr Glu 
305 310 315 

AAT GAG CAT CGT GCT GCT GAA GCA ACT CCC CAG ACC TGAGCCGAGT 1248 
Asn Glu His Arg Ala Ala Glu Ala Thr Pro Gin Thr 
320 325 

GTCCTGGCTA CTACACTTGG CAGTCTGCCT CCCAGACCCT CTTTCCCCGC CCGGCTGAGG 1308 

CCATCATGGG GATGCGGTCT AGTTGGCTCT TAGCAGCATC AAGCTGTACA TGAGCTAGTT 1368 

TGTAGTGACT CACTGCAGAG CCCCCCAGAC TGGCTTGTGG TTCTGTTTCT AAAGTTATTG 1428 

GAATAAGAAG CAATTAAACA AGTTTGTAAT AAAAA 1463 



(2) INFORMATION FOR SEQ ID NO: 86: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Met Gly Ala Thr Gly Asp Thr Glu Gin Pro Arg Gly Pro Gly Gly Ala 
1 5 10 15 

Glu Arg Gly Gly Leu Glu Leu Gly Asp Ala Giy Ala Ala Gly Gin Pro 
20 25 30 

Val Leu Thr Asn Pro Trp Asn He Met He Lys His Arg Gin Val Gin 
35 40 45 

Arg Arg Gly Arg Arg Ser Gin Met Thr Thr Ser Phe Thr Asp Pro Ala 
50 55 60 

He Ser Met Asp Leu Leu Arg Ala Val Leu Gin Pro Ser He Asn Glu 
65 70 75 80 

Glu He Gin Ser Val Phe Asn Lys Tyr Met Lys Phe Phe Gin Lys Ala 
85 90 95 

Ala Leu Asn Val Arg Asp Asn Val Gly Glu Glu Val Asp Ala Glu Gin 
100 105 no 

Leu He Gin Glu Ala Cys Arg Ser Cys Leu Glu Gin Ala Lys Leu Leu 
115 120 125 

Phe Ser Asp Gly Glu Lys Val He Pro Arg Leu Ala His Glu Leu Pro 
130 135 140 

Gly He Lys Arg Gly Arg Gin Ala Glu Glu Glu Ser His Arg Gly Ser 
145 150 155 160 

Pro He Pro Lys Lys Arg Lys Gly Arg Pro Pro Gly His Val Leu Ser 
165 170 175 

Asn Asp Arg Ala Ala Ala Gly Met Val Trp Lys Pro Lys Ser Cys Glu 
180 185 190 

Pro He Arg Arg Glu Gly Pro Lys Trp Asp Pro Ala Arg Leu Asn Glu 
195 200 205 

Ser Thr Thr Phe Val Leu Gly Ser Arg Ala Asn Lys Ala Leu Gly Met 
210 215 220 

Gly Gly Thr Arg Gly Arg He Tyr lie Lys His Pro His Leu Phe Lys 
225 230 235 240 

Tyr Ala Ala Asp Pro Gin Asp Lys His Trp Leu Ala Glu Gin His His 
245 250 255 

Met Arg Ala Thr Gly Gly Lys Met Ala Tyr Leu Leu He Glu Glu Asp 
260 265 270 

He Arg Asp Leu Ala Ala Ser Asp Asp Tyr Arg Gly Cys Leu Asp Leu 
275 280 285 
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Lys Leu Glu Glu Leu Lys Ser Phe Val Leu Pro Ser Trp Met Val Glu 
290 295 300 

Lys Met Arg Lys Tyr Met Glu Thr Leu Arg Thr Glu Asn Glu His Arg 
305 310 315 320 

Ala Ala Glu Ala Thr Pro Gin Thr 
325 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW095 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 7.. 876 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: complement (631.. 798) 

(D) OTHER INFORMATION: /label= SAC_23802 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

GTCGAC ATT TTC TCA GAA TCC TAC GGA GGA AAG ATG GCT GCT GGC ATC 48 
He Phe Ser Glu Ser Tyr Gly Gly Lys Met Ala Ala Gly He 
1 5 10 

AGT TTA GAA CTT CAC AAG GCT ATT CAG CAA GGG ACC ATC AAG TGC AAC 96 
Ser Leu Glu Leu His Lys Ala He Gin Gin Gly Thr He Lys Cys Asn 
15 20 25 30 

TTC TCT GGG GTT GCT TTG GGT GAC TCC TGG ATC TCC CCT GTG GAT TCA 144 
Phe Ser Gly Val Ala Leu Gly Asp Ser Trp He Ser Pro Val Asp Ser 
35 40 45 

GTG CTG TCC TGG GGA CCT TAC CTG TAC AGC GTG TCT CTC CTT GAT AAT 192 
Val Leu Ser Trp Gly Pro Tyr Leu Tyr Ser Val Ser Leu Leu Asp Asn 
50 55 60 

AAA GGC TTG GCT GAG GTG TCC GAC ATT GCG GAG CAA GTC CTC AAT GCT 240 
Lys Gly Leu Ala Glu Val Ser Asp He Ala Glu Gin Val Leu Asn Ala 
65 70 75 

GTA AAC AAG GGC TTC TAC AAG GAA GCC ACT CAG CTG TGG GGG AAA GCA 288 
Val Asn Lys Gly Phe Tyr Lys Glu Ala Thr Gin Leu Trp Gly Lys Ala 
80 85 90 

GAA ATG ATC ATT GAA AAG AAC ACC GAC GGG GTA AAC TTC TAT AAC ATC 336 
Glu Met He He Glu Lys Asn Thr Asp Gly Val Asn Phe Tyr Asn He 
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95 100 105 no 

TTA ACT AAA AGC ACC CCC GAC ACC TCT ATG GAG TCG AGC CTC GAG TTC 384 
Leu Thr Lys Ser Thr Pro Asp Thr Ser Met Glu Ser Ser Leu Glu Phe 
115 120 125 

TTC CGG AGC CCC TTA GTT CGT CTC TGT CAG CGC CAC GTG AGA CAC CTA 432 
Phe Arg Ser Pro Leu Val Arg Leu Cys Gin Arg His Val Arg His Leu 
130 135 140 

CAA GGA GAC GCC TTA AGT CAG CTC ATG AAC GGT CCC ATC AAA AAG AAG 480 
Gin Gly Asp Ala Leu Ser Gin Leu Met Asn Gly Pro He Lys Lys Lys 
145 iso 155 

CTC AAA ATT ATC CCT GAC GAT GTC TCC TGG GGA GCC CAG TCA TCC TCC 528 
Leu Lys He He Pro Asp Asp Val Ser Trp Gly Ala Gin Ser Ser Ser 
160 165 170 

GTC TTC ATA AGC ATG GAA GAG GAC TTC ATG AAG CCT GTC ATC GAC ATC 576 
Val Phe He Ser Met Glu Glu Asp Phe Met Lys Pro Val He Asp He 
175 180 185 190 

^ GTC GAT ACG TTG CTG GAA CTC GGG GTC AAT GTG ACT GTG TAC AAT GGG 624 
Val Asp Thr Leu Leu Glu Leu Gly Val Asn Val Thr Val Tyr Asn Gly 
195 200 205 

CAG CTG GAT CTC ATT GTG GAC ACC ATA GGT CAG GAG TCC TGG GTT CAG 672 
Gin Leu Asp Leu He Val Asp Thr He Gly Gin Glu Ser Trp Val Gin 
21° 215 220 

AAG CTG AAG TGG CCA CAG CTG TCC AGA TTC AAT CAG CTA AAA TGG AAG 720 
Lys Leu Lys Trp Pro Gin Leu Ser Arg Phe Asn Gin Leu Lys Trp Lys 
225 230 235 

GCC CTG TAC ACC AAT CCT AAG TCT TCA GAA ACA TCT GCG TTT GTC AAG 768 
Ala Leu Tyr Thr Asn Pro Lys Ser Ser Glu Thr Ser Ala Phe Val Lys 
240 245 250 

TCC TAT GAG AAC CTA GCG TTC TAC TGG ATC CTA AAG GCG GGT CAC ATG 816 
Ser Tyr Glu Asn Leu Ala Phe Tyr Trp He Leu Lys Ala Gly His Met 
255 260 265 270 

GTT CCT GCT GAC CAA GGG GAC ATG GCT CTG AAG ATG ATG AGG CTG GTT 864 
Val Pro Ala Asp Gin Gly Asp Met Ala Leu Lys Met Met Arg Leu Val 
275 280 285 

ACT CAG CAG GAG TAGCTGAGCT GAGCTGGCCC TGGAGGCCCT GGAGGCCCTG 916 
Thr Gin Gin Glu 
290 

GAGGCCCTGG AGTAGGGCCC AGGATGCAGG TGCTAATGTC TATCCCCGGC GCTCTTCTTC 976 

CCGACTCTAC CATGGGATGT AACTCCAGGA GCCCCTGCCA TCTCCGGTAC CAAAAGACTG 1036 

TGGCTTCCGT GTCTACTCAG AAATCAGTTC TACTTCGTAA ACAGTGTTTA AAACCAGACT 1096 

CATTTAATCA GAGTGAAGGA TTGCAGTCCA TTGGCTTCTT AGCACAGAAG CAGCTGATAA 1156 

CACAAGTAAA CCCCAGCCCT TGAGAGGTAG AAGCAAGAGG ATCAGAGGTT CAAGCGCATC 1216 

CTCGGCTCCA TCACAAGTTC AAAAGCCGCC TGCACCAAAT GGGAGTCCTT GTCTCAAAAA 1276 
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(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

He Phe Ser Glu Ser Tyr Gly Gly Lys Met Ala Ala Gly lie Ser Leu 
1 5 10 15 

Glu Leu His Lys Ala He Gin Gin Gly Thr He Lys Cys Asn Phe Ser 
20 25 30 

Gly Val Ala Leu Gly Asp Ser Trp He Ser Pro Val Asp Ser Val Leu 
35 40 45 

Ser Trp Gly Pro Tyr Leu Tyr Ser Val Ser Leu Leu Asp Asn Lys Glv 
50 55 60 

Leu Ala Glu Val Ser Asp He Ala Glu Gin Val Leu Asn Ala Val Asn 
65 70 75 80 

Lys Gly Phe Tyr Lys Glu Ala Thr Gin Leu Trp Gly Lys Ala Glu Met 
85 90 95 

He He Glu Lys Asn Thr Asp Gly Val Asn Phe Tyr Asn He Leu Thr 
100 105 no 

Lys Ser Thr Pro Asp Thr Ser Met Glu Ser Ser Leu Glu Phe Phe Ara 
115 120 125 

Ser Pro Leu Val Arg Leu Cys Gin Arg His Val Arg His Leu Gin Gly 
130 135 140 

Asp Ala Leu Ser Gin Leu Met Asn Gly Pro He Lys Lys Lys Leu Lys 
145 150 155 160 

He He Pro Asp Asp Val Ser Trp Gly Ala Gin Ser Ser Ser Val Phe 
165 170 175 

He Ser Met Glu Glu Asp Phe Met Lys Pro Val He Asp He Val Asd 
180 185 190 

Thr Leu Leu Glu Leu Gly Val Asn Val Thr Val Tyr Asn Gly Gin Leu 
i95 200 205 

Asp Leu He Val Asp Thr He Gly Gin Glu Ser Trp Val Gin Lys Leu 
21° 215 220 

Lys Trp Pro Gin Leu Ser Arg Phe Asn Gin Leu Lys Trp Lys Ala Leu 
225 230 235 240 

Tyr Thr Asn Pro Lys Ser Ser Glu Thr Ser Ala Phe Val Lys Ser Tyr 
245 250 255 

Glu Asn Leu Ala Phe Tyr Trp He Leu Lys Ala Gly His Met Val Pro 
260 265 270 
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Ala Asp Gin Gly Asp Met Ala Leu Lys Met Met Arg Leu Val Thr Gin 
275 280 285 

Gin Glu 
290 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW096 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 295.. 1302 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GTCGACGTCT GATTTTACAT TCATGCAGTT GGGAGTAAGA GGAAAAAGAT TCTGAAGTGT 60 

GGGGAACTTC ACAAAGAGGG GAACAAAGTC TGTGTCTATG GCTCCAAGGG AGAGAACATC 120 

AGAGACACTC TGAGGAAGGA CGGCAGGTTT TGTTCTTTCA TAGAGAATAA CCATTGGAAA 180 

CTCATTAATG ACCTGGACAC TATCATAGAA AACACTCAGC CATTTGATGA GTTGGAGGGC 240 

AAGCTCTTTC AGGTTGCAGC TGAGCTACCA AAGAACCCTA GGGTAGTCTC TGTC ATG 297 

Met 
1 

CAG AAT TCT GGG TTA GAG AAC AGA GAC TTC CAT AAG CTA GAA GAC TAT 345 
Gin Asn Ser Gly Leu Glu Asn Arg Asp Phe His Lys Leu Glu Asp Tyr 
5 10 15 

ATT GTG AAT GCG AAC CCT ACA TTG AAA GAA GAA GGA GAA AAA CTC AGA 393 
lie Val Asn Ala Asn Pro Thr Leu Lys Glu Glu Gly Glu Lys Leu Arg 
20 25 30 

GCA TAC ATC AAG AAA GAA AGT AAA AAA AGA AAG AAA GGT TCC TTA TTC 441 
Ala Tyr lie Lys Lys Glu Ser Lys Lys Arg Lys Lys Gly Ser Leu Phe 
35 40 45 

AAA GTG CAT AAA GAA CAC TTT GGG AAA ATA ACA AAA AAT TCT ACT CCT 489 
Lys Val His Lys Glu His Phe Gly Lys He Thr Lys Asn Ser Thr Pro 
50 55 60 65 

GTT AAA GTG CAC AAA CAT CTT TCG AAA GTC AGT GAT TCA GTT GGG TTC 537 
Val Lys Val His Lys His Leu Ser Lys Val Ser Asp Ser Val Gly Phe 
70 75 80 

CTA TGG TGG AAC AAC AAT GGA AAG GAG GGC TGT GCC ACC TGC TTT GTT 585 
Leu Trp Trp Asn Asn Asn Gly Lys Glu Gly Cys Ala Thr Cys Phe Val 
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85 90 95 

TTT AAA GGA TTA TAC ATT TTG ACT TGC CGG CAT GTG ATA ACT AGT ATT 633 
Phe Lys Gly Leu Tyr He Leu Thr Cys Arg His Val He Thr Ser He 
100 105 HO 

GTG GGT GAA GGC ATA GAT CCA TGT CAG TGG GCA AGC TTA ATT AGT CAG 681 
Val Gly Glu Gly He Asp Pro Cys Gin Trp Ala Ser Leu He Ser Gin 
115 120 125 

TGT GTA AAG GTG ACC TTT GGT TAT GAA GAG TTC CCA CTA GTA GAA GAC 729 
Cys Val Lys Val Thr Phe Gly Tyr Glu Glu Phe Pro Leu Val Glu Asp 
130 135 140 145 

AAG TTT TTT GAG GTT AAA CCT TGG TTT GAG ATA TCT AAT GAA AGC CTT 777 
Lys Phe Phe Glu Val Lys Pro Trp Phe Glu He Ser Asn Glu Ser Leu 
150 155 160 

GAC TAT GCT GTC CTA GAA CTG AGG GAA AAT GGA CAA GAA GTT CCT GCT 825 
Asp Tyr Ala Val Leu Glu Leu Arg Glu Asn Gly Gin Glu Val Pro Ala 
165 170 175 

GGA CTG TAT AAT GGA ATA GGA CCT GTG CCA CTT GGT GGG TTA ATA TAT 873 
Gly Leu Tyr Asn Gly He Gly Pro Val Pro Leu Gly Gly Leu He Tyr 
180 185 190 

ATC ATT GGC CAT CCT GAT GGA GAA AAG AAG TCT ACT GAT GGC TGT GCA 921 
He He Gly His Pro Asp Gly Glu Lys Lys Ser Thr Asp Gly Cys Ala 
195 200 205 

GTG GTC CCT CAA AGT CAT AGA GGG CAA AAA TGT CAG GAA AAT TTT CAA 969 
Val Val Pro Gin Ser His Arg Gly Gin Lys Cys Gin Glu Asn Phe Gin 
210 215 220 225 

GCA AGA GAG AAA GCA AAC TAC TGG ATT TCT ACA TCT TTT TTT CCT ATA 1017 
Ala Arg Glu Lys Ala Asn Tyr Trp He Ser Thr Ser Phe Phe Pro He 
230 235 240 

TAC ACA CAA AGA AGT TTC CAC GAA GTC TTT CAC AAC CAT GAT GTG GTT 1065 
Tyr Thr Gin Arg Ser Phe His Glu Val Phe His Asn His Asp Val Val 
245 250 255 

ACT TAT GAC ACC ACT TTT TTT GGT GGG TCT TCT GGA TCC CCA GTA TTT 1113 
Thr Tyr Asp Thr Thr Phe Phe Gly Gly Ser Ser Gly Ser Pro Val Phe 
260 265 270 

GAT TCT AAT GGT TCA TTG GTG GCC ATG CAT GCT GCT GGC ATC ACT TGT 1161 
Asp Ser Asn Gly Ser Leu Val Ala Met His Ala Ala Gly He Thr Cys 
275 280 285 

ACA GAC CAG AAT GGA GTT TTT AAT ATC ATT GAG TTT GGT TTT ACT ATG 1209 
Thr Asp Gin Asn Gly Val Phe Asn He He Glu Phe Gly Phe Thr Met 
290 295 300 305 

GAA TCC ATT CTT GCT AAT ATT AAG CAA ACT AAA GAG TGG TAT GAT ACC 1257 
Glu Ser He Leu Ala Asn He Lys Gin Thr Lys Glu Trp Tyr Asp Thr 
310 315 320 

ATT TTT GTA AAT GAT CGG GAT GCA GAA ATG CTG AGC ACA GAC TCC 1302 
He Phe Val Asn Asp Arg Asp Ala Glu Met Leu Ser Thr Asp Ser 
325 330 335 

TGAGGACTGG AAAGGGTGTA TTCTAGTTCA CAACTTTAAG GGAATTGCCT AAGGTGTTTT 1362 
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TCTTCAGATA 


ATAATGTTTT 


ATGAACTTTC 


AAAATGATTA ATTTCACCCA ATAATCATAG 


1422 


TTCTAGAGTG 


TATCTTAACT 


ATCAAGCATT 


TTCATGGTTT ATTAAAACAA ACAATCAAAA 


1482 


ACAAAACTAT 


CAGTTGAACT 


ATTTGTGTAA 


AGTAGCATGA AAAAGAGGAT TAAGTGCTGA 


1542 


TGTGGGATCC 


GAGGGTCAGG 


AGATGCTAAG 


ATGTGAAAAA GTTGAGAGCA TAAACTGGAG 


1602 


TTATTAGCCA 


GAAGATTCAT 


GGAAAAGAAA 


GAAGCCTACC ACGTTCCTTG AAACTTAAAC 


1662 


CAATTTCAAA 


CTCACTCATT 


TGGTTCTACC 


CATATCTTCC TTCTCTGCTG ACCAGAGCTC 


1722 


AGCCAGGACA 


CTGAATCACA 


GCGACTTAAC 


CCTCCCAGGG CCCTGAACAT GGTGGCATTT 


1782 


CTTCCATTTT 


GTCTTTTTAA 


CTTAAGACTT 


AGATCTGAAA TGAACCCACA TGAAGTGGCA 


1842 


TGGAAATCAT 


TGAGCAAGCA 


AGGAGCTGGT 


CTCTGCCACA GAATGCATGT ACCTGAGACT 


1902 


TTCTGGACTT 


GAGTTGTGGA 


GAGCAAACAT 


TGGAATAGGT GGTTTCTACT TGATACGATT 


1962 


CACAATGGAG 


AGTGCTAGGA 


GGATATTGGG 


CACAACCTAG GACAATTCTC AGCAGTGTGA 


2022 


CACTCAAGTC 


TTGGTCCTTT 


GAAGAGAAAG 


AAAACTGCGT TCAAGAAGTA ATCTCATCAA 


2082 


AAACAGAGTC 


AAGGAGAATT 


TATGAACAGA 


AGACCAGAGA TCTTAGATTA TTTGAGAGAA 


2142 


AGTAGGAATA 


GATGAGATGC 


TGAAGGAATG 


TTGCTGGGAT CCAGCGATTG CTTCAATTCT 


2202 


TCCTTTATCA 


TAACTGAAAT 


ATGTCCTGTA 


TGAGAGCTAA CATGATCCTA ATCCACAACT 


2262 


CCCCTTATTT 


TCCAGACATT 


TCCTAGTGCT 


ATTGTGTGAT ATTTTGTGTC CCCTGCCCAA 


2322 


TGTATTTTTC 


CCAAATCAAA 


TACAGGTGAA 


TCAATTAAAA AACAGTGTAA TATCTGTAGT 


2382 


AGGTTTTGAA 


ATATAAGATG 


AAAAAATTAA 


AAAGTCGAC 


2421 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Gin Asn Ser Gly Leu Glu Asn Arg Asp Phe His Lys Leu Glu Asp 
1 5 10 15 

Tyr lie Val Asn Ala Asn Pro Thr Leu Lys Glu Glu Gly Glu Lys Leu 
20 25 30 

Arg Ala Tyr lie Lys Lys Glu Ser Lys Lys Arg Lys Lys Gly Ser Leu 
35 40 - 45 

Phe Lys Val His Lys Glu His Phe Gly Lys lie Thr Lys Asn Ser Thr 
50 55 60 

Pro Val Lys Val His Lys His Leu Ser Lys Val Ser Asp Ser Val Gly 
65 70 75 80 
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Phe Leu Trp Trp Asn Asn Asn Gly Lys Glu Gly Cys Ala Thr Cys Phe 
85 90 95 

Val Phe Lys Gly Leu Tyr lie Leu Thr Cys Arg His Val lie Thr Ser 
100 105 110 

lie Val Gly Glu Gly He Asp Pro Cys Gin Trp Ala Ser Leu He Ser 
115 120 125 

Gin Cys Val Lys Val Thr Phe Gly Tyr Glu Glu Phe Pro Leu Val Glu 
130 135 140 

Asp Lys Phe Phe Glu Val Lys Pro Trp Phe Glu He Ser Asn Glu Ser 
145 150 155 160 

Leu Asp Tyr Ala Val Leu Glu Leu Arg Glu Asn Gly Gin Glu Val Pro 
165 170 175 

Ala Gly Leu Tyr Asn Gly He Gly Pro Val Pro Leu Gly Gly Leu lie 
180 185 190 

Tyr He He Gly His Pro Asp Gly Glu Lys Lys Ser Thr Asp Gly Cys 
195 200 205 

Ala Val Val Pro Gin Ser His Arg Gly Gin Lys Cys Gin Glu Asn Phe 
210 215 220 

Gin Ala Arg Glu Lys Ala Asn Tyr Trp He Ser Thr Ser Phe Phe Pro 
225 230 235 240 

He Tyr Thr Gin Arg Ser Phe His Glu Val Phe His Asn His Asp Val 
245 250 255 

Val Thr Tyr Asp Thr Thr Phe Phe Gly Gly Ser Ser Gly Ser Pro Val 
260 265 270 

Phe Asp Ser Asn Gly Ser Leu Val Ala Met His Ala Ala Gly He Thr 
275 280 285 

Cys Thr Asp Gin Asn Gly Val Phe Asn He He Glu Phe Gly Phe Thr 
290 295 300 

Met Glu Ser He Leu Ala Asn He Lys Gin Thr Lys Glu Trp Tyr Asp 
305 310 315 320 

Thr He Phe Val Asn Asp Arg Asp Ala Glu Met Leu Ser Thr Asp Ser 
325 330 335 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: HW097 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: complement (203.. 553) 

(D) OTHER INFORMATION: /label* SAC_23978 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 



GTCGACTATT 


ACTATAGCCT 


TGTATATACT 


CCCCTCCGGC 


CCTGTTTGTC 


TCAGTCCCAT 


60 


TCCCTCTTGT 


CTCTGAGAAT 


CATCCCCCAG 


CCCCAAGTTC 


TCCCTTCTGT 


CTCCCCCCAC 


120 


CCCCCCTGCC 


TCCCATTCTC 


CTGTCTCTGA 


ATGCCTTTGC 


CTGTATAAAG 


AGTTGGATTC 


180 


TCCCCTGGTG 


TCTGTACTGT 


GTACACACAT 


CCCTTTGAGA AGCACAAGGA 


GATGACACGC 


240 


GCATTGTAAC 


CTTCACACTG 


TCTCGGTGGC 


GACATAAAGG 


AAGCTGTGAA 


TTACAAGCTC 


300 


TGCCTCTTTC 


TGGCCTCTCC 


CTCGGCCCCC 


ATGACCGGGG 


CACTCTCTGC 


CCTCCCCACG 


360 


GCCTTAACAT 


GTTCCTGCTC 


CACCTATCCT 


GGTGCCCTTT 


GTCTGGCTGA 


CTGTAGCCTC 


420 


CCTGAGGGAA 


GGGATTGCTG 


GAGAGGTAGC 


TCCTATCCAG 


GAATGAAGGA 


CACTGGCTGG 


480 


AAACTAAGCC 


AAGAATGTTA 


CAGACAGGGG 


GAGCTGGGGA TTGGTGACTG 


ACCCTTGCAG 


540 


TTGGGCAGTG 


AGTACCCAAG 


GCCAGAAGCT 


GAAGCCCTCA 


TTCATTCAGT 


GGCCTTTCTT 


600 


TCCTCAGGGT 


GCCCAGTGCC 


CTGTCCTCAC 


TTTGATGCCT AGGCCCACTG 


CTTGCATTTT 


660 


CTGGTGAGCC 


AGGGCTTGGT 


CATGTGAGGA 


AAGAGGTAGG 


CCTAGAGCAG 


GAGTTGAAAG 


720 


CCCTGCTGTT 


GTGTGTCCCT 


ACCATACAGG 


GCTGGCTAAA 


AGTCGAGGTA 


GACGCTTATC 


780 


TAGGAAGCGC 


AAGGCCCTGG 


GTTCGGTCCC 


CAGCTCGGAA 


AAAAAAGAAC 


CAAAAA 


836 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC23798 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
GGGGGATCCG CCCGATCCAA GACTGCCCAG GGTGGCTGCT CCATCTCACC CAGAGCAACC 60 
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GGAGCTGGAC CTGGGGGTCG GAGCACTTCG AAGGTGATGG GTTATGGTGC CGCCTGCTGG 



120 



ACATAGTTGT CCAGAGCTCT TCCTGAAAGG TTTGCTACAG GATTCGCCCT TCTCATCCAG 



180 



TCTAAGAAAC TACTTAGTCA ACTCCTAGTG TCTAAGCAGA AGGCTTCAGC TATGGCTTCT 



240 



CCCCTTGGGG AGGAAGAGGA AGAGGTACAA ATTCTTGTTT TGCTGCTCCC AGGGAAAAGA 



300 



TCGGGCTAGA GCGGCCGC 



318 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_23800 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

GGGGGATCCG CCCGATCACA CGGCTGCCTT CTGGTATACA GCAGAGGGGA ATGCCAAGGT 60 

ATAAGTATCC TTCAATTCCT TCTTTGCCAC ATACCAGCAT TTCTTTGCCC TTTTGGCATT 120 

CTACCTCTTC TTTGACAGCA CTGAGACTGT TAAGTGCTGC ATACAAGCTA CTTGTCTCCC 180 

TATCTGTNNG CACATGTTTC ATGTCATGAA GATGATAAGG TCAGGCAAAG AGGCATGAAG 240 

ATTCCTGAAT AGTGCCCAGA TCGGGCTAGA GCGGCCGC 278 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_23801 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
GGGGGATCCG CCCGATCCAG TAGCGACAGC CCTGGCCCCT GACGAGGATG GAAGTGACGT 60 
TGTAGCNATT GTCTTCCTGT AGCTCATAGA TGGTGCTGTN NATGGTAAAG CGGCTTTGTC 120 
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TTTCTTTCTG GACCGCATTG CCTGCCAGGN NGACAACGAA CCACCTGCCC TGGAACCGTT 180 

CGGTCCAGAA GCCTGGCTGC AGGGGCACAC TGATCGATCC AGTAGCGACA GCCCTGGCCC 240 

CTGACGAGGA TGGAAGTGAC GTTGTAGCTA TTGTCTTCCT GTAGCTCATA GATGGTGCTG 300 

TACATGGTAA AGCGGCTTTG TCTTTCTTTC TGGACCGCAT TGCCTGCCAG GCCGACAACG 360 

AACCACCTGC CCTGGAACCG TTCGGTCCAG AAG 393 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_23833 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

GGGGGATCCG CCCGATCTGT ATCTCGTCCC TTGGCTGGCT GAGCCTCTGG AGGAGGTGTC 60 

CAAAGCCCCG GTGGATTTCT GTCTCAGGGG TCTCTGTGAG ATTGAACTTG AGACCTTCTA 120 

GAATCTCTTC CATGCTGTTG CCCTTTGCTC CCAGGGACAC GACGGCCAAG GCGGCTGAGA 180 

TGCTAAGTGG GGAGAAGACA ACATTTTTAT CTGGATTCCT CAAAGCCAGC TTCCTGTAGA 240 

GGCTGAAGGC AAAGTCAGTA TTGATGGAAG CCAGTGTGAG ACTGTCCAGT TGTGTCCCCT 300 

TGTCTTGGTC TTCATGGAAT AGAGTGTCCT CTCCCAGTAT GCAAATCTGG GAAGCAGAGG 360 

AGAGCAGAGC CCGATCGATC AAGATGACTA AGATGCTCAA AGGATTCGAC GCTGTGGGAA 420 

ATGCCACAGG TTTCCGGTTC GCCTACACCC CAGCTATGGA GAGCCTCTGT GGATATGTCA 480 

CAAGTCCAGA ACCGCAGCGA GGAGTTTCTC ATCGCGGGCC GTTTAAGGAC G 531 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 
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(B) CLONE: SAC_23836 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



GGGGGATCCG CCCGATCATG ATGAATCCCC TGGGACCAAG CTATAACACA CAGTCCTCTG 



60 



AGAAAAAGGA TTGTGGTTTG CAGCAAATTC AGCCAATTTT GTCAAGCAAT TTGTGAGAGA 



120 



CACTCCTTAC TTGGTTGTTT CCTTCTCAGT CTTACGGGGT GCCTTATCAC TCCGTGATGG 



180 



TTCATGGCCA CACAGTCATA TTTCAGGGAG AAGTCCTTGT CGGTCACACC AGTTATCCTT 



240 



AACAGTGAGG TTAAGCAAAT CATGCCATTG CTGG 



274 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_23853 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

GGGGGATCCG CCCGATCGTG TTATTGAGGT CATCCACGGT GGAGATTAGC AGGTGACTCC 60 

GTTTTCCTGG AAATGAGTGG AGTTGGGAAG GGGACCCGTG GCCGTGCTCC AGCTATAGTT 120 

GGTGGGCGGT GGTTCGCTGC GAGCTTCACA GGTCAGGTTC ACGTTAGTAG GCCCAATGTA 180 

CCAGTTGCCT TCATAGCCAG AGATGGACAC TTCGGGTGGA TAAGGTAGGG AAAGGATCGG 240 

GCTAGAGCGG CCGC 254 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_23906 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

GGGGGATCCG CCCACAGAAC CCAGAGGAAG 6AGAGGCTGC TGGGGTGGAG GCCTAGGCGC 60 

TGGAGACATG TGGAGTTCTC TAGGGGTCTG CAGCAACCTC GGAAAGCTGG GAGATTCCCT 120 

CCTTGAGACT CCTACATATA GAAAACTGAT GCTTCTGTCT CATTCCATGC GGCTTTTCCT 180 

GCGGTATTCC TGTAGCGCTT TCTCTGCCAC TGTGTCCATA AACTTAGGGT TATCCTTGGA 240 

GACTTCTTCT GGTAC 255 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_23907 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

GGGGGATCCG CCCACCCCAT CTAATCCTCT GAGGCTCAGG GTGACTCATT CTTCCTTGGA 60 

CCCAGGGGCG GGGCCGACAT TTGTTCCCCC AGTTGGGCCC TGCTGCCCCA GGCCAGTGGC 120 

ATTCCCAGTG TCTCCAGCCA CTAAAGCCAC ATTCCTCAGG TAGTTGGTAT TGAAGCAGTT 180 

GGCTTGTTTA TCTCCAGGAG ACAAGGTTCG AGCGGCCGC 219 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_23916 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GGGGGATCCG CCCTCGAGCG GCCGCCCACA AGAGATTGAA CCTAGAGCTG TGTGCCTCCT 60 
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GGGCAAGCAC TCTCTACTGA GCTACATCCC TGTTAAAGTG CCTTTTTTGG GAGCTTTGTC 120 
TTCCAGCCTG CCAATCAACC CACTTTATGG GTGTGCCTAG ATTCCCCTTT CTCTGAGTAG 180 
GG 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_23922 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 101: 
GGGGGATCCG CCCACTGTTC TCAAACAGTT GCCTAACAAT ACATTTAATT AAGAAATTAT 
CATAACTGTA TGCTTACACA TATGTGGATA GGCATCAATG AAGATTACAA CAGCTCAGCC 
ACAGTGCTGC AATGCTCTAC ACTACTACAA AACACCCACC ACGGGCCTGN NACCTGCCCT 
CGAGCGGCCG C 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24030 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
GGGGGATCCG CCCACCATAT CTATACGGCC CTGGAAGAGG AGATAGAGCG AAACAAGCAG 
AACCCAGTCT ATGCCCCGCT CTACTTNCCT GAGGAGCTGC ACCGAAGGCT GCCCTAGAGC 
AGGACATGGC CTTCTGGTAT GGGCCCCACT GGCAGGAGGC CATCCCTTAN NNACCAGCCA 
CACAGCACTA CGTAAAGCGT CTCCACGAGG TGGGAGGTGG GCTAGAGCGG CCGC 
(2) INFORMATION FOR SEQ ID NO: 103: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

<B) CLONE: SAC_24078 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
GGGGGATCCG CCCGATCAAG ATGACTAAGA TGCTCAAAGG ATTCGACGCT GTGGGAAATG 60 
CCACAGGTTT CCGGTTCGCC TACACCCCAG CCATGAGAGC CTCTGTGGAT ATGTCCACAA 120 
GTCCCAGAAC CCGCAGCGAG GAGTTTCTCA TCGCGGGCCG TTTAAGGAAC GG 172 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24105 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

GGGGGATCCG CCCGATCTGT GCATGTGAGC CGAACAGAAG TCAGCAATAA CCATGTCTTG 60 

ATTTACCTGG ATAAGGTGTC AAATCAGACG GTGAACTTGT CCTTCACGGC TCAGCAAGAT 120 

ATTCCAATAA GGGACCTGAA GCCAGCCGTA GTGAAAGTCT ACGATTACTA TGAGAAAGAT 180 

GAGTTTGCAG TTGCAAAATA CAGCGCTCCC TGCAGCACAG ATTATGGAAA TGCCTGAGGA 240 

C 241 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24140 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

GGGGGATCCG CCCGATCCTT AAAGGTCCCA TAACCAAGTA CTCCATGCAG TCATCTCCTC 60 

TAACCTGTGG GTCCCCTCCA TTCATTGCAA GCTGCTGGAC ATAGCCTGCT GGGTCCACCA 120 

CAAGTACAAC AGTGACAAGT CCAGCACCTA TGTGAAGAAT GGCACATCCT TCGACATCCA 180 

CTACGGCTCA GGTAGCCTCT CTGGGTACCT GAGCCAGGAC ACTGTGTTGG TTCC 234 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24142 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

GGGGGATCCG CCCGATCTAT CCGGGTATAT TCGGAACTGT GGAGTCTGAC CGCAGTGCAC 60 

TGCTACTCAG CTCACTCTGT CTCTTTCTCA CTGTGGGAGT AGCTATGGGA AAGGTCACAC 120 

TTCCTTGCTT TCCCCTCAAA CCAACAGGGC TTTCACTTCT CAGTCTCCCA ACGCACTGCA 180 

CACTGCTCCC AGCATCCCGC ACAGTCTCAC CACACAC 217 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24192 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
GGGGGATCCG CCCACATCTC AGCGGTGGGG ACTCAGACAT TCCTGTGTTC CCTCCTTGGC 
CTCCAGCCTC TCTGTAGGAA CCTCCAGCAG CCTGCCACCA GATTTCCCTT AGCTTCCACT 
GTCTCCATGA GCTTTAAAAT GTGGGCGGCC GCTCGAGGGC TAGAGCGGCC GC 
(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
GGGGGATCCG CCCACTTGGC TGACCGCCAG GCGAGTGACT GTGACTCCAG GTTCGCCCAC 
TACCGCAGCA GATACTGCCT CGCCTATGCC ACCTTCATAG TAGTGGTCCT CCACGGTGAG 
GATCCTGCCT TTGGTTGCTC TGGCACAGTC GAGAATGAGC TTTTTGTCCA GGGGCTTGAT 
GGTGAAGGGG TCCAGTGGGC TAGAGCGGCC GC 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24445 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
GGGGGATCCG CCCACAGCAT TCGGGTATGA GGCAAGGGGA GTGTTCATTC ACACACACAG 60 
TAGCTTCGGG TCTGTGAGGT CTCCTTGTTC CCGGGTAGGT TTCTAAAGAC GGAGAAAAAA 120 
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ATGATTCTGG TTATCAAGAC TACTGTGACC GTATTAGATC CCAGAATGGG CAAGCATCAG 180 
TGTGTGACCA TGCGAACAAA AGGAATTTTT 210 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24527 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

GGGGGATCCG CCCACCATGG TCTCACTCAG AGACGGGTTT GGTCAGTTTT AAGATGAACT 60 

CCTTCGGATG TAGTGTCCGT AAGTCCTTAT TATCGCTGTC CATCAATTCA CGATAGGATA 120 

GCTGGCTGCG GTGGCAAGTC CGCAGTGGTT GTTCCACCTG CCCACATTAT TCTCTCGTGG 180 

CCATGGTTCT TCCCTTAGAG 200 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24540 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

GGGGGATCCG CCCACAGTGG GAAGGATGGA AATAGCTGTC ATCTCTCCAA AACTGAGTTC 60 

CTTTCCTTCA TGAACACGGA GCTGGCCGCC TTCACGAAGA ACCAGAAGGA CCCCGGTGTC 120 

CTCGACCGCA TGATGAAGAA GCTGGACCTC AACAGTGATG GGCAGCTAGA TTTCCAAGAG 180 

TTTCTCAACC TTATTGGTGG CTTAGCTATA GCACCTGCCC ACTCTCGTGG CTGGCCCTCT 240 

GCTTATACTC CTTGGACTGC TCCAGGCTGT GTGTTTCCAC GCTTGACCTG CCCACAGAGC 300 

CAAAAGAGAC ACATTGGCTA CTTTAACCAC CTTAAAGCGG ACTCCAGGAA TATCACCCAC 360 
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GGCATGGCCT TTTGCGAACA AATCCCANCA ACCAAAAAAA 400 
<2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: SAC_24623 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

GGGGGATCCG CCCACAGTGC ATATTGGCGG CGCTCGCCTC ATTACGATTC GCCAGCTTGC 60 

TTCTCTTGTT CAATTGTTTC TTTTGAAGGC AGAGGATTTT TCTCTTGTGT TTCTGTCTTC 120 

TTCAACTTCG ACTTATCGAA TTTCACCTGC CCACTAAACC CACAGCCCAG CCCTCCTTCC 180 

TTCCTCCTGT CCTGGAATTG 200 
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What is claimed is: 

1 . Purified and isolated nucleic acid encoding all or a unique fragment of a KIM. 

2. Nucleic acid of claim 1 that is RNA. 

3. Nucleic acid of claim 1 that is DNA. 
5 4. Nucleic acid of claim 3 that is cDNA. 

5. Nucleic acid that is complementary to the nucleic acid of claim 1. 

6. Nucleic acid that hybridizes to the nucleic acid of claim 1 or 5. 

7. Nucleic acid of claim 1 that is in operative association with an expression control 
element therefor. 

10 8. Antisense nucleic acid of claim 1 that is sufficient, when internalized within a cell, 
to disrupt expression of a cellular KIM gene. 

9. Purified and isolated nucleic acid encoding all or a unique fragment of a KIM, said 
nucleic acid having a sequence selected from the sequences listed in TABLE 1. 

10. Purified and isolated nucleic acid encoding all or a unique fragment of a KIM, said 
15 nucleic acid having a sequence selected from degenerate variants of the sequences 

listed in TABLE 1. 

1 1. Purified and isolated nucleic acid encoding all or a unique fragment of a KIM, said 
nucleic acid having a sequence selected from homologous variants of the sequences 
listed in TABLE 1. 

20 12. Purified and isolated nucleic acid encoding a KIM fusion polypeptide, the sequence 
of said nucleic acid comprising a KIM sequence selected from the sequences listed 
in TABLE 1 fused in frame to a sequence encoding a non- KIM polypeptide. 

13. Nucleic acid of claim 12 wherein the encoded non-KIM polypeptide is selected 
from: a secretable leader polypeptide, an immunoglobulin polypeptide, a binding 
25 pair partner, a toxin or toxoid, an enzyme, and a detectable polypeptide. 
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14. A vector comprising a KIM-encoding nucleic acid of any one of claims 1 to 13. 

15. A host cell stably transformed or transfected with a vector of claim 14. 

16. A method for producing a KIM polypeptide, comprising the steps of growing a host 
cell of claim 15 under conditions sufficient for the production of polypeptides of 

5 vector origin; and, recovering an expressed KIM polypeptide. 

17. A purified and isolated polypeptide comprising all or a unique fragment of a KIM, 
said polypeptide having a sequence selected from the sequences encoded by nucleic 
acids listed in TABLE 1. 

18. A purified and isolated polypeptide comprising all or a unique fragment of a KIM, 
10 said polypeptide having a sequence selected from variants of the sequences encoded 

by nucleic acids listed in TABLE 1 . 

19. A polypeptide of claim 18 wherein said variant is a splice variant, truncation 
variant, or substitution variant of an encoded TABLE 1 sequence. 

20. A KIM fusion polypeptide, the sequence of said polypeptide comprising all or a 
15 unique fragment of an encoded KIM sequence listed in TABLE 1 , fused to a non- 

KIM polypeptide sequence. 

21 . A fusion polypeptide of claim 20 wherein the non-KIM polypeptide sequence is 
selected from: a secretable leader polypeptide sequence, an immunoglobulin 
polypeptide sequence, a binding pair partner sequence, a toxin or toxoid sequence, 

20 an enzyme sequence, and a detectable polypeptide sequence. 

22. A conjugate comprising a KIM polypeptide of claim 17, 18 or 19, linked to a 
detectable moiety, imageable moiety, radiolabeled moiety, or toxin moiety. 

23. A KIM fusion polypeptide, the sequence of said polypeptide comprising all or a 
unique fragment of an encoded KIM sequence selected from nucleic acid sequences 

25 listed in TABLE 2, fused to a non-KIM polypeptide sequence. 

24. A fusion polypeptide of claim 23 wherein the non-KIM polypeptide sequence is 
selected from: a secretable leader polypeptide sequence, an immunoglobulin 
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polypeptide sequence, a binding pair partner sequence, a toxin or toxoid sequence, 
an enzyme sequence, and a detectable polypeptide sequence. 

25. A conjugate comprising a KIM polypeptide of claim 23 or 24, linked to a 
detectable moiety, imageable moiety, radiolabeled moiety, or toxin moiety. 

5 26. An antibody that binds selectively to all or a unique fragment of a KIM polypeptide 
encoded by a nucleic acid sequence listed in TABLE 1 . 

27. A conjugate comprising an antibody of claim 26, linked to a detectable moiety, 
imageable moiety, radiolabeled moiety, or toxin moiety. 

28. An antibody that binds selectively to all or a unique fragment of a KIM polypeptide 
10 encoded by a nucleic acid sequence listed in TABLE 2. 

29. A conjugate comprising an antibody of claim 29, linked to a detectable moiety, 
imageable moiety, radiolabeled moiety, or toxin moiety. 

30. Use in therapy of an antisense KIM nucleic acid of claim 8. 

3 1 . Use in therapy of a KIM vector of claim 14. 

15 32. Use in therapy of a KIM polypeptide of claim 17, 18 or 19. 

33. Use in therapy of a KIM fusion protein of claim 20 or 2 1 , or of a KIM conjugate of 
claim 22. 

34. Use in therapy of an antibody of claim 26, or of an antibody conjugate of claim 27. 

35. A pharmaceutical composition comprising a physiologically acceptable carrier 

20 having dispersed therein, to a therapeutically effective concentration: a KIM nucleic 

acid of claim 8; a KIM vector of claim 14; a KIM polypeptide of claim 17, 18 or 
19; a KIM fusion protein of claim 20 or 21; a KIM conjugate of claim 22; an anti- 
KIM antibody of claim 26; or, an anti-KIM antibody conjugate of claim 27. 

36. A method for detecting the presence or status of renal injury or of renal disease, 
25 comprising the step of measuring the concentration of a KIM polypeptide in a 
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sample selected from serum, urine or urine sediment of an individual afflicted with, 
or at risk of developing, renal injury or an impairment of renal function. 

37. A method for detecting the presence or status of an abnormal response to tissue 
injury or other stimulus, comprising the step of measuring the concentration of a 

5 KIM polypeptide in a sample selected from serum, urine or urine sediment of an 

individual afflicted with, or at risk of developing, an autoimmune response or 
abnormal tissue growth arising from or affecting renal tissue. 

38. A method according to claim 36 or 37, wherein said measuring step involves 
contacting said sample with an anti-KIM antibody of claim 26 or 28, or with a 

10 conjugate of claim 27 or 29. 

39. A kit for practice of the method of claim 36 or 37, comprising at least one reagent 
selected from: the anti-KIM antibody of claim 26; the anti-KIM antibody of claim 
28; the conjugate of claim 27; and, the conjugate of claim 29. 

40. A method for detecting the presence or status of renal injury or of renal disease, 
IS comprising the step of measuring the level of a KIM nucleic acid in a sample 

selected from serum, urine or urine sediment of an individual afflicted with, or at 
risk of developing, renal injury or an impairment of renal function. 

41. A method for detecting the presence or status of an abnormal response to tissue 
injury or other stimulus, comprising the step of measuring the level of a KIM 

20 nucleic acid in a sample selected from serum, urine or urine sediment of an 

individual afflicted with, or at risk of developing, an autoimmune response or 
abnormal tissue growth arising from or affecting renal tissue. 

42. A method according to claim 40 or 41 , wherein said measuring step involves 
contacting said sample with a KIM nucleic acid of claim 9, 10 or 1 1; or with a 

25 nucleic acid probe that hybridizes to KIM nucleic acid of claim 9, 10 or 1 1 ; or with 

a nucleic acid having a sequence selected from sequences listed in TABLE 2; or 
with a nucleic acid probe that hybridizes to a nucleic acid listed in TABLE 2. 
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43. A kit for practice of the method of claim 40 or 4 1 , comprising at least one reagent 
selected from: the KIM nucleic acid of claim 9, 10 or 1 1; or a nucleic acid probe 
that hybridizes to KIM nucleic acid of claim 9, 10 or 1 1 ; or a nucleic acid having a 
sequence selected from sequences listed in TABLE 2; or a nucleic acid probe that 
hybridizes to a nucleic acid listed in TABLE 2. 

method of imaging cells or tissue expressing or producing a KIM, comprising the 
3ps of contacting the cells or tissue with an imageable KIM-binding reagent; and, 
imaging an accumulation of the KIM-binding reagent. 

method according to claim 44 wherein the cells or tissue are disposed in body 
sue of an individual suspected of harboring an imageable locus of KIM 
expression and/or production. 

46. A method of locating a tissue mass abnormally producing or expressing a KIM in 
an individual suspected of affliction with an abnormal tissue response to injury or 
other stimulus, comprising the step of: administering an imageable KIM-binding 
reagent to the individual; and, imaging said tissue mass within the body of the 
individual. 

47. A method according to claim 44 wherein the tissue mass is a tumor arising from or 
affecting renal tissue of the individual. 

A method of treating an individual afflicted with, or at risk of developing, a disease 
or condition contributed to or associated with a dysfunction or dysregulation of a 
KIM gene or protein, comprising the step of administering to the individual a 
pharmaceutical composition of claim 35. 

49. A method of treating an individual afflicted with, or at risk of developing, a disease 
or condition beneficially affected by therapeutic administration of a KIM 
25 composition of the present invention, comprising the step of administering to the 

individual a pharmaceutical composition of claim 35. 



48. 

20 
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50. A method according to claim 48 or 49 wherein the disease is a renal disease; or, the 
condition is an impairment of renal function. 

51. A method of inhibiting the growth of KIM-expressing tumor cells, comprising the 
step of contacting the tumor cells with a pharmaceutical composition comprising 
involving the step of contacting the cells with: an antisense KIM nucleic acid of 
claim 8; a KIM fusion protein of claim 20, 21, 23 or 24; a KIM conjugate of claim 
22 or 25; an antibody of claim 26 or 28; or an antibody conjugate of claim 27 or 29. 

52. A method according to claim 36, 37, 40, 41 , 45, 46, 47, 48, 49, or 50 wherein the 
individual is a human. 
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